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Single Nucleotide Pol vinorDhisnis sensitively predicting Adverse Drug Reactions 
(ADR) and Drug Efficacy 

Technical Field . 

This invention relates to genetic polymoiphisms useful for assessing cardiovascular 
risks in humans, including, but not limited to, atherosclerosis, ischemia/reperiiision, 
hypertension, restraosis, arterial inflammation, myocardial in&rction, and stroke. In 
addition it relates to gen^c polymoiphisms useful for assessing the response to lipid 
lowering dmg therspy. Specifically, the present invention identifies and describes 
gene variations which are individually present in humans with cardiovascular disease 
states, rela to humans with nonnal, or non-cardiovascular disease states, and/or in 
response to medications relevant to cardiovascular disease. Further, the present 
invention provides methods for the identification and then5>cutic use of compounds 
as treatments of cardiovascular disease. Moreover, the present invention provides 
methods for the diagnostic monitoring of patimts undergoing clinical evaluation for 
the treatment of cardiovascular disease, and for monitoring the efficacy of 
compounds jn clinical trials. Still further, the present invention provides mefliods to 
use gene variations to predict persomd medication scfamies omitting adverse dmg 
reactions and allowing an adjustment of flie drug dose to achieve maYiiT^iim benefit 
for the patiat Additionally, the present invoition describes meQiods for the 
diagnostic evaluation and prognosis of various cardiovascular diseases, and for die 
identification of subjects exhibiting a predisposition to such conditions. 

Background of the Invention 

Cardiovascular disease is a mqor health risk throughout the industrialized world. 

Cardiovascular diseases include but are not limited by the following disorders of the 
heart and the vascular system: congestive heart fiulure, myocardial infarction, 
atherosclerosis, ischemic diseases of the heart, coronary heart disease, all kinds of 
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atrial and ventricular anhytbniias, hypertensive vascular diseases and peripheral 
vascular diseases. 

Heart failure is defined as a pa&ophysiologic state in which an abnonnality of 
cardiac function is responsible for the failure of tfie heart to pump blood at a rate 
commensurate with the requirement of the metabolizing tissue. It includes all forms 
of pumping failure such as hi^-output and low-output, acute and chronic, right- 
sided or left-sided, systolic or diastolic, independent of the underlying cause. 

• Myocardial infiiction (MI) is generally caused by an abnq)t decrease in coronary 
blood flow that follows a thrombotic occlusion of a coronary artery previously 
narrowed by arteriosclerosis. MI prophylaxis (prirnary and secondary prevention) is 
included as well as the acute treatment of MI and the prevration of complications. 

Ischemic diseases are conditions in which the coronary flow is restricted resulting in 
an perfusion which is inadequate to meet the myocardial requirement for oxygm. 
This groiQ) of diseases include stable angina, unstable angina and asynai>tomatic 
ischemia. 

Aniq^thmias include all forms of atrial and ventricular tachyarrhythmias (atrial 
tachycardia, atrial flutter, atrial fibrillation, atrio-ventricular reentrant tachycardia,, 
preexitation syndrome, ventricular tachycardia, ventricular flutter, ventricular 
fibrillation) as well as bradycardic forms of arrhythmias. 

Hypertensive vascular diseases include primary as well as all kinds of secondary 
arterial hypartension (renal, endocrine, neurogenic, others). 

Peripheral vascular diseases are defined as vascular diseases in whidi arterial and/or 
venous flow is reduced resulting in an imbalance between blood supply and tissue 
oxygen demand. It includes chronic peripheral arterial occlusive disease (PAOD), 
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acute artaial thrombosis and embolism, inflammatory vascular disoiders, Raynaud's 
phenomenon and venous disorders. 

Atherosclerosis, the most prevalent of vascular diseases, is the principal cause of 
heart attack, stroke, and gangrene of the extremities, and thereby the principal cause 
of death. Atherosclerosis is a complex disease involving many cell types and 
molecular fectors (for a detailed review, see Ross, 1993, Nature 362: 801-809 and 
Lusis. A. J., Nature 407, 233-241 (2000)). The process, in normal circumstances a 
protective response to insults to the endoflielium and smootfi muscle cells (SMCs) of 
the wall affile artery, consists of flie formation of fibrofatty and fibrous lesions or 
plaques, preceded and accompanied by inflammation. The advanced lesions of 
atherosclCTOsis may occlude the artery concerned, and result torn an excessive 
inflammatory-fibroproliferative response to numerous diflferent forms of insult. For 
example, shear stresses are thought to be responsible for the fi^uent occurrence of 
atherosclerotic plaques in regions of the circulatory system where turbulent blood' 
flow occurs, such as branch points and irregular structures. 

The first observable event in the fonnation of an atherosclerotic plaque occurs when 
blood-borne monocytes adhere to flie vascular endothelial layer and transmigrate 
through to the sub-endothelial space. Adjacent endoflielial cells at tiie same time * 
produce oxidized low density lipoprotein (LDL). These oxidized LDLs are flicn 
taken rxp in large amounts by the monocytes through scavenger recqitors «q)ressed 
on their surfiu^es. In contrast to the regulated pathway by which native LDL (nLDL) 
is taken \sp by nLDL specific recq)tors, the scavenger pathway of i5)take is not 
regulated by the monocytes. 

These lipid-fiUed monocytes are called foam cells, and are the major constituent of 
the fatty streaL Interactions between foam celk and the endothehal and SMQs w 
sunoimd them lead to a state of chronic local inflammation which can eventually 
lead to smooth muscle cell proliferation and migration, and the fonnation of a fibrous 
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plaque. Such plaques occlude the blood vessel concerned and thus restrict the flow of 
blood» resulting in ischemia . 

Ischemia is a condition characterized by a lack of oxygen siqiply in tissues of organs 
due to inadequate perfusion. Such inadequate perfusion can have number of natural 
causes, including atherosclerotic or restenotic lesions, anemia, or stroke, to name a 
few. Many medical interventions, such as the interruption of the flow of blood during 
bypass surgeiy, for example, also lead to ischemia. In addition to sometimes being 
caused by diseased cardiovascular tissue, ischemia may sometimes affect 
cardiovascular tissue, such as in ischemic heart disease. Ischemia may occur in any 
organ, however, that is suffering a lack of oxygen supply. 

The most common cause of ischemia in the heart is atherosclerotic disease of 
epicardial coronary arteries. By reducing the lumen of these vessels, atherosclerosis 
causes an absolute decrease in myocardial perfusion in the basal state or limits 
appropriate increases in perfusion v/hea Hie dmiand for flow is augmented Coronary 
blood flow can also be limited by arterial fhrombi, spasm, and, rarely, coronary 
emboli, as well as by ostial nanowing due to luetic aortitis. Congenital abnormalities, 
such as anomalous origin of the left anterior descending coronary artery fiom the 
pulmonary arteiy, may cause myocardial isdiemia and in&rction in inftncy, but tiiis 
cause is very rare in adults. Myocardial ischemia can also occur if myocardial 03^gen 
demands are abnormally increased, as in severe ventricular hypertrophy due to 
hypertension or aortic stenosis. The latter can be present with angina that is 
indistinguishable from that caused by coronary atherosclerosis. A reduction in the 
oxygen-carrying Cdp^dty of the blood, as in extremely severe anonia or in the 
presence of carboxy-hemoglobin, is a rare cause of myocardial ischemia. Not 
infrequentiy, two or more causes of ischemia will coexist, such as an inocase in 
oxygen demand due to left ventricular hypertrophy and a reduction in oxyga sapply 
secondary to coronary atherosclerosis. 
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The foregoing studies are aimed at defining tbe role of particular gene vjiiations 
presumed to be involved in tiie misleading of nonnal cellular function. leading to 
cardiovascular disease. However, such approacbes cannot identify the full panoply of 
gene variations tiiat.are involved in the disease p]t>cess. 

At present, the only available treatments for cardiovascular disorders are pharma- 
ceutical based medications that are not targeted to an individual's actual defect; 
exanqiles include angiotensin converting enzyme (ACE) inhibitors and diuretics for 
hypertension, insulin siqiplementation for non-insulin dq)endent diabetes mellitus 
^ (NIDDM), cholesterol reduction strategies for dyslipidaemia, anticoagulants, p 
blockers for cardiovascular disorders and weight reduction strategies for obesity. If 
targeted treatment strategies were available it might be possible to predict the 
response to a particular regime of ther^y and could markedly increase the 
eflfectiveness of such treatment Although targeted ther^y requires accurate 
diagnostic tests for disease susceptibility, once these tests are develq)ed the 
opportunity to utilize targeted Him^y will become widespread Such diagnostic tests 
could initially serve to identify individuals at most risk of hypertension and could 
allow them to make changes in lifes^le or diet that would serve as preventative 
measures. The benefits associated by coiq>Iing the diagnostic tests with a system of 
targeted therspy could include the reduction in dosage of administered drugs and thus 
±e amount of unpleasant side effects suffered 1^ an individual In more severe cases 
a diagnostic test may suggest that earlier surreal intervration would be useful in 
preventing a iurdier deterioration in condition. 

It is an object of flie invention to provide genetic diagnosis of predisposition or 
suscqjtibihty for cardiovascular diseases, Anoth^ related object is to provide treat- 
ment to reduce or prevent or delay the onset of disease in those predisposed or 
suscq)tible to fliis disease. A fiirther object is to pn).vide m 
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Accordingly, a first aspect of the invention provides a method of diagnosis of disease 
in an individual, said method comprising determining one, various or all. genotypes 
in said individual of the gmes listed in the Examples. 

In another aspect, the invention provides a mefliod of identifying an individual 
predisposed or susceptible to a disease, said melhod comprismg detemrining one, 
various or all genotypes in said individual of the gmes listed in the Exanqiles. 

The invention is of advantage in that it enables diagnosis of a disease or of certain 
disease states via genetic analysis which can yield useable results before onset of 
disease symptoms, or before onset of severe symptoms. The invention is further of 
advantage in that it enables diagnosis of predisposition or susceptibility to a disease 
or of certain disease states via genetic analysis. 

The invention may also be of use in confimiing or corroborating the results of other 
diagnostic methods. The diagnosis of the invention may thus suitably be used either 
as an isolated tedmique <x in combination with other mediods and apparatus for 
diagnosis, in which latter case the invention provides a fiirflier test on which a 
diagnosis may be assessed. 

The present invention stems from using allelic association as a mediod. for 
genotyping individuals; allowing the investigation of the molecular genetic basis for 
cardiovascular diseases, la a specific embodiment the invention tests for the 
polymorphisms in the sequences of the listed genes in Ac Examples. The invention 
demonstrates a link between this polymorphisms and predispositions to cardio- 
vascular diseases by showing that allele fi:equaicies significantly differ when 
individuals with **bad'* senmi lipids are con:q)ared to individuals with "good" serum 
levels. The meaning of "good and bad" scrum lipid levels is defined in Table la 

The PROCAM algorithm defines also a risk assessment based on lipids (LDL- 
cholesterol, HDL^olesteiol, triglycerides) and risk fectors like smoking, high blood 
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pressure or diabetes . meUitus (Assmann, G., Schulte, H. von Eckardstein, A: Am J 
Cardiol 77 (1996): 1179-1184). . 

Catain disease states would benefit, tiiat is to say the suffering of flie patient may be 
reduced or prevented or delayed, by administration of treatment or thet^y in 
advance of disease appearance; this can be more reliably carried out if advance 
diagnosis of predisposition or susceptibility to disease can be diagnosed. 

Pharmacogcnomics and adverse drug reactions 

Adverse drug reactions (ADRs) remain a major clinical problem. A recent meta- 
analysis suggested tiiat in the USA in 1994, ADRs were req>onsible for 100 000 
deaths, making them betweoi the fourth and sixth commonest cause of death 
(Lazarou 1998, J. Am. Med Assoc. 279:1200). Although these figures have been 
heavily criticized, they enqihasize the importance of ADRs. Indeed, there is good 
evidence that ADRs account for 5% of all hospital admissions and increase the length 
of stay in hospital by two days at an increased cost of f-$2S00 per patient ADRs are 
also one of die commonest causes of drug withdrawal, which has enormous finandal 
implications fear fhc phar m aceutical industry. ADRs, peifaqis fintunately, only affect 
a minority of those taking a particular drug. Alfliougji fitctors diat detennine 
suscq)tibility are unclear in most cases, there is increasing interest in the role of 
genetic Actors. Indeed, die role of inheritable variations in predisposing patients to 
ADRs has been q>preciated since the late 1950s and eariy 1960s through the 
discovay of deficiencies in enzymes such as pseudocholmesterase (butyryl- 
cholinesterase) and glucose-6-phosphate dehydrogenase (G6PD). More recenfly, with 
the first draft of the human genome just completed, there has been renewed interest 
in this area with the introduction of terms such as pharmacogenomics and 
toxicogenomics. Essentially, the aim of pharmacogenomics is to produce 
pCTsonalized medicines, whereby administration of the drug class and dosage is 
tailored to an individual graotype. Thus, the terai pharmacogenomics embraces bofli 
efificacy and toxicity. 



wo 2004/018709 



-8- 



PCr/EP20O3/0p9126 



The 3-hydroxy-3.methylglutaryI co«izyme A (HMG-CoA) reductase, inbibitors 
Cstatins") specifically inhibit the en2yme HMG-CoA reductase winch catalyzes flie 
rate limiting step in cholesterol biosynthesis. These drugs are effective in reducing 
the prinuuy and secondaty risk of coionaiy artery disease and coronary events, such 
as heart attack, in middle-aged and older men and women, in both diabetic and non* 
diabetic patients, and are often prescribed for patioits with hypedipideniia. Ststtins 
used in secondaiy prevention of coronaiy arteiy or heart disease significantly reduce 
the risk of stroke, total mortality and morbidity and attacks of myocardial isdiemia; 
the use of statins is also associated with improvements in endothelial, and fibrinolytic 
fimctions and decreased platelet thrombus formation. 

The tolerability of these drugs during long term administration is an important issue. 
AdvCTse reactions involving skeletal muscle are not uncommon, and sometimes 
serious advCTse reactions involving skeletal muscle such as myopatlq^ and 
ihabdomyolysis may occur, requiring discontinuation of the drug. In addition an 
increase in semm creatine kinase (CK) may be a sign of a statin related adverse 
event The extend of such adverse events can be read fiom the extend of the CK level 
increase (as conq)aied to the iqyper limit of nonnai [Ui^ 

Occasionally artiualgia, alone or in association witii myalgia, has been reported. Also 
an elevation of liver transaminases has been associated with statin administration. 

It was ^own tiuU the drug response to statin ther^y is a class effects, i.e. all known 
and presumably also all so far undiscovered statins share the same benefical and 
hannfiil effects (Ucar, M. et al.. Drag Safety 2000, 22:441). It foUows tbat the 
discovery of diagnostic tools to predict the drug response to a sin^e statin will also 
be of aid to guide therapy with other statins. 

The present mvention provides diagnostic tests to predict .the patient's individual 
response to statin therapy. Such responses include, but are not limited by the extent 



wo 2004/018709 



-9- 



PCT/EP2003/P09126 



of adverse drug reactions, the level of lipid lowering or the drug's influence on. 
disease states. Those diagnostic tests may predict the response to statin ther^y ei&er 
alone or in combination with anotho* diagnostic test or anoflier drug regimen. 

Detailed Description of the Invention 

The present invention is based at least in part on the discovery that a specific allele of 
a polymorphic region of a so called "candidate gene*' (as defined below) is associated 
with CVD or drag response. 

For the present invention the following candidate genes were analyzed: 

Genes found to be expressed in cardiac tissue (Hwang et al., Circulation 1997, 
96:4146-4203). 

Genes from the following m^abolic pathways and tfieir legulatoty elements: 
Lipid metabolism 

Numerous studies have shown a connection between serum lipid levels and 
cardiovascular diseases. Candidate genes fidling into this gicii^ include but are not 
limited by genes of the diolesterol pathway, apolipoproteins and their modifiying 
ftctors. 

Coagulation 

Ischemic diseases of the heart and in particular myocardial infarction may be caused 
by a thrombotic occlusion. Genes Ming into this groiq) include all genes of the 
coagulation cascade and their regulatory elements. 
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Inflammation 

Complications of atherosclax>sis are the most common causes of death in Western 
societies. In broad outline a&erosclerosis can be considered to be a form of chronic 
inflammation resulting fiom interaction modified lq)oproteins, monocyte-derived 
macrophages,! cells, and the normal cellular elements of the arterial wall. Ihis 
infl a mm a t ory process can ultimately lead to the development of complex lesions, or 
plaques, that protrude into the arterial lumen. Finally plaque nq}ture and Arombosis 
result in the acute clinical complications of myocardial infarction and stroke (Glass et 
aL,Cen2001, 104:503-516). 

It follows that all gaies related to inflammatory processes, including but not limited 
by cytokines, cytokine receptors and cell adhesiori molecules are candidate gmes for 
CVD. 

Glucose and enei^ metabolism 

As glucose and mogy metabolism is interdependent with the metabolism of lipids 
(see above) also the former pathways contain candidate genes. Energy m^abolism in 
genml also relates to obesity, which is an independent risk fiu4or for CVD 
(Melansoh et aL, Cardiol Rev 2001 9:202-207). In addition high blood glucose levels 
are associated wifli many microvascukff and macrovascular complications and msy 
therefore affect an individuals disposition to CVD (Duckworth, Curr Atheroscler Rep 
2001,3:383-391). 

Hypertension 

As hypertension is an independent risk factor for CVD, also genes fliat are involved 
in the regulation of systolic and diastolic blood pressure affect an individuals risk for 
CVD (Safer, Cuir Opin Cardiol 2000, 15:258-263). Interestingly hypertension and 
diabetes (see above) appear to be interdependent, since hypertension is 
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approximately twice as frequent in patients with diabetes compared widi patients 
without the disease. Conversely, recent data suggest that hypertensive persons are 
more predisposed to the development of diabetes than are nonnotensive persons 
(Sowers et al.. Hypertension 2001, 37:1053-1059). 

Genes related to drug response 

Those genes include metabolic pathways involved in the absorption, distribution, 
metabolism, excretion and toxicity (ADMET) of drugs. Prominent members of this 
group are the cytochrome P450 proteins which catalyze many reactions involved in 
drug metabolism. 

Unclassified genes 

As stated above, the mechanisms that lead to cardiovascular diseases or define the 
patient* s individual response to drags are not completely elucidated. Hmce also 
candidate genes were analysed, vfbidi could not be assigned to the above listed 
categories. The present invention is based at least in part on fiie discovery of 
polymorphisms, Oat lie in gmomic regions of unknown physiological function. 

Results 

After conducting an association study, we surprisingly found polymorphic sites in a 
number of candidate genes which show a strong correlation with the following 
phenotypes of the patients analysed: •'Healthy" as used herein refers to individuals 
that neither suffer torn existing CVD, nor exhibit an increased risk for CVD through 
their serum lipid level profile, "CVD prone" as used herein refers to individuals with 
existing CVD and/or a scrum lipid profile that confers a high risk to get CVD (see 
Table la for definitions of healthy and CVD prone serum lipid levels). Tfi^ 
lespondw" as used herein refos to patients ^o benefit fiom relative^ small 
amounts of a given drag. "Low icspondcr" as used herdn refers to patients who need 
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relatively high doses in order to obtain benefit fiom the medication. "Tolerant 
patient" refers to individuals who can tolerate high doses of a medicament without 
exhibiting adverse drug reactions. "ADR patient" as used herein refers to individuals 
who suffer fiom ADR or show clinical symptoms (like creatine kinase elevation in 
blood) even after receiving only minor doses of a medicament (see Table lb for a 
detailed definition of drug response phenotypes). 

Polymorphic sites in candidate genes that were found to be significantly associated 
with dther of flie above mentioned phenotypes will be referred to as "phenotype 
associated SNPs" (PA SNPs). The respective genomic loci that harbour PA SNPs 
will be referred to as "phenotype associated genes" (PA genes), irrespective of the 
actual function of this gene locus. 

In particular we surprisingly found PA SNPs associated with CVD, drug efficacy 
(EFF) or adverse drug reactions (ADR) in tiie following genes. 

ABCBll: ATP^binding cassette, sub-fiunily B (MDR/TAP), member 11 

The membrane-assodated protein encoded by this gme is a member of the 
siqier&mily of ATP4>inding cassette (ABC) transporters. ABC protdns transport 
various molecules across extfa*^ and intra-cellular ihemlHanes. ABC gene^ are 
divided into seven distinct subfimilies (ABCl, MDR/TAP, MRP, AID, OABP, 
GCN20, White). This protein is a member of the MDR/TAP subfamily. Membws of 
the MDR/TAP subfamily are involved in multidrug resistance. The protein encoded 
by this gene is tiie inajor canalicular bile salt export pump in maiL Mutations in this 
gene cause a form of progressive familial intrahepatic cholestases which are a groiq) 
of inherited disorders with severe cholestatic liver disease fiom early infimcy. 
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ABCB4: ATP-binding cassette, sub-famii^ B (MDI^ 

The membrane-associated protein encoded by this gene is a member of the 
superfemily of ATP-binding cassette (ABC) transporters. ABC proteins .transport 
various molecules across extra- and intra-ceUular membranes. ABC genes are 
divided into seven distinct subfemiKes (ABCl, MDR/TAP, MRP,. ALD, OABP, 
GCN20, White). This protein is a member of the MDR/TAP subfamily. Members of 
the MDR/TAP subfiimily are involved in multidrug resistance as well as antigen 
prescntatioiL This gene encodes a fiill transporter and member of the p-glycoprotein 
£amily of membrane proteins with phosphatidylcholine as its substrate. The function 
of this protein has not yet been determined; however, it may involve transport of 
phoq)holipids fiom liver hepatocytes into bile. Alternative splicing of this gene 
results in several products of undetomined function. 

ABCCl: ATP-binding cassette, sab*iiunily C (CFTR/MRP), member 1 

The protein encoded by this gene is a member of (be superfamily of ATP-binding 
cassette (ABC) transporters. ABC proteins transport various molecdes across extra- 
and intra-cellular membranes. ABC genes are divided into seven distinct subfimilies 
(ABCl. MDRH'AP. MRP, ALD. OABP, GCN20, White). This fiill transporter is a 
member of the MRP subfemily which is involved in multi-dmg resistance. This 
protein functions as a multispedfic organic anion transporter, with oxidized 
glutatione, cysteinyl leukotricnes, and activated aflatoxin Bl as substrates. This 
protem also transports glucuronides and sulfate conjugates of steroid hormones and 
bile salts. Alternative splicing by exon deletion results in several splice variants but 
maintains the original open reading firame in all forms. 

ACTB mRNA for mutant beta-actin 

Beta actin is one of six different actin isofoims which have been idotified. ACTB is 
one of the two nonmuscle cytoskelctal actins. Actins are highly conserved protems 
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that are involved in cell motility, structure and integrity. Alpha actins are a major 
constituent of the contractile ^paratus. 

ACTTN, ALPHA SKELETAL MUSCLE (ALPHA-ACTIN 1) 

Actin alpha 1 which is expressed in skeletal muscle is one of six different actin 
isoforais which have been identified. Actms are highly conserved protdns that are 
involved in cell motility, structure and integrity. Alpha actins are a mqor constituent 
of the contractile qipaiatus. 

ADCYAPl: adenylate cyclase activating polypeptide 1 (pituitary) 

This gene encodes adenylate cyclase activating polypeptide 1. Mediated by adenylate 
cyclase activating polypqjtide 1 receptors, this polypeptide stimulates adoiylate 
cyclase and subsequently increases the cAMP level in target cells. Adaiylate cyclase 
activating polypeptide 1 is not only a hypqphysiotibpic hormone, but also functions 
as a neurotransmitter and neuromodulator. Jn addition, it plays a role in paracrine and 
autocrine regulation of certain types of cdls. This gene is composed of five exons. 
Exons 1 and 2 encode the 5'. UTR and signal peptide, respectively; exon 4 encodes an 
adenylate q^lase activating polypeptide 1-related peptide; and exon 5 encodes the 
mature pqytide and 3' UTR. This gene encodes three different mature peptides, 
including two isotypes: a shorter form and a longer form. 

ADRB3: adreneiigic, beta-3-, receptor 

The ADRB3 gene product, beta-S-adraiergic receptor, is located mainly in adipose 
tissue and is involved in the regulation of lipolysis and thermogenesis. Beta 
adrenergic recq>tors are involved in ibe epenephrine and rK>repinq>hriiie-induced 
activation of adenylate cyclase through the action of G proteins. 
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AGL: amylo-l, 4-aIpha-g^aGanotruisferase (gJiyeogBSk debrandung 

razyme, ^cogen storage disease type m) 

Glycogen debranching enzyme is involved in gfycogen degradation and has two 
independent catalytic activities: a 4-alpha-glucotiansferase activity (EC 2.4.1.25) and 
a amylo.l,6-glucosidase activity (EC 3.4,1.33). Both activities occur at different sites 
on the single polypeptide chain. Mutations in this gene cause glycogen storage 
disease. A wide nmge of clinical and enzymatic variability occurs in glycogen 
debrancher deficiency, some of which may be due to tissue-specific alternative 
splicing. Six splice varients that differ in the. 5' end have been identified in livo* and 
muscle tissue. Variants 1, 5, and 6 are present in both liver and muscle, whereas 
variants 2, 3, and 4 occur in muscle. Variants 1 toough 4 encode identical proteins 
(isofoim 1) that inchide 27 N-tenninal amino acids not found in splice variants 5 and 
6. Variants 5 and 6 encode different amino-terminal ends of 10 and 11 amino adds in 
protein isofoims 2 and 3, respectively, with the remainder of the peptide identical to 
thatofisofomis 1.. 

AKAPl: A kinase (PRKA) anchor protein 1 

Anchors cAMP-dq)endent protein kinase near its physiological substrates, interacts 
witii both the type I and type n regulatory subunits. 

Angiotensinogen gene 

The protein encoded by this gene, pre-angiotcnsinogcn or angiotensinogen precursor, 
is e3q)ressed in the liver and is cleaved by the enzyme renin in response to lowered 
blood pressure. The resulting product, angiotcnsm I is then cleaved by angiotensin 
converting enzyme (ACE) to generate the physiologically active en^me angiotensin 
n. Hie protein is involved in maintaining blood pressure and in the pathogenesis of 
essential hypertension and preeclanq>sia. 
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ANXA6: annexin A6 

Annexin VI belongs to a femfly of calcium-dependent membrane and phospholipid 
binding proteins. Al&ou^ their functions are still not clearly defined, several 
members of the annexin family have been inaplicated in membrane-related events 
along exocytotic and endocytotic pathways. The annexin VI gene is q)prpximately 
60 kbp long and contams 26 exons. It encodes a protein of about 68 kDa that consists 
of eight 68-amin6 acid repeats sq>arated by linking sequences of variable lengths. It 
is hi^y similar to human annexins I and n sequences, each of which contain four 
such repeats. Exon 21. of annexin VI is altmiatively spliced, giving rise to two 
isoforms that differ by a 6-amino acid insertion at the start of the seventh repeat 
Annexin VI has been implicated in mediating the endosome aggregation and vesicle 
fusion in secreting epithelia during exocytosis. 

AP2B1: adaptor-related protein complex 2, i>eta 1 subanit 

The beta adaptin subunit is part of flic clathrin coat assembly .conq}lex which links 
clalfarin to receptors in coated pits and vesicles. These vesicles are involved in 
endoc^sis and Golgi processing. The beta 1 subunit is one of tfa^ 
which binds to clathrin and initiates coat formation. 

APOAl: apolq>oprotdn A-I 

APOAl promotes cholesterol efflux fiom tissues to the liver for excretion. 
Apolipoprotein A-I is the major protein con^wnent of high density lipoprotein 
(HDL) in the plasma. Synthesized in the Kver and small intestine, it consists of two 
identical chmns of 77 amino adds; an 18-amino acid signal peptide is removed co- 
translationally and a 6-ammo add propeptide is dcaved post-translationally. 
Variation in the latter step, in addition to modifications leading to so-called isofomis, 
is responsible for some of the polymoiphism observed APOAl is a cofactor for 
lecithin cholestcrolacyltransfcrase (LCAT) which is responsible for the fomiation of 
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most plasma cholesteryl esters. The APOAl, AP0C3 and AP0A4 genes are closely 
linked in both rat and human genomes. The A-I and A-IV genes arc transcribed fiom 
the same strand, while the C^m gene is transcribed convergently in relation to A-I. 
Defects in the a^lipoprotein A-i gene are associated with HDL deficiency and 
Tangio- disease. 

APOA4: apolipoprotein A-IV 

Apolqjrotein (apo) A-IV gene contains 3 exons separated by two introns. A sequence 
polj^oiphism has been identified in the 3TJTR of the third exon. The primaiy 
translation product is a 396-residue preprotein which after proteolytic processing is 
sea:eted its primary site of synthesis, the intestine, in association witfi chylomicron 
particles. Although its precise fimction is not known, apo A-IV is a potent activator 
of lecithin-cholesterol acyltransferase in vitro. 

APOB: apolipoprotein B 

Apolq>oprotein B (ApoB) is the main q>olipoprotein of chylomicrons and low 
dosily lipoproteins (LDL). The protein occurs in the plasma in 2 main isoforms, 
q>oB-48 and q>oB-100. Ibe first is synthesized exclusive^ by the gut, ttie second by 
the Uv&r. The intestinal (B-48) and hepatic (B-lOO) forms of BpoB are coded ]by a^ 
smgle gene and by a single mRNA transcript larger than 16 kb. The 2 proteins share 
a common amino terminal sequence. In the ApoB-100 isoform the precursor has 
4,563 amino acids, and the mature q>oB-100 has 4,536 amino acid residues. Mature, 
circulating B-48 is homologous over its entire length (estimated to be between 2,130 
and 2,144 amino acid residues) with the amino-tenninal portion of B-lOO and 
contains no sequence fix>m the carboxyl end of B- 100. From structural studies, it is 
thought that ^B-48 represents the ammo-tmninal 47% of i^B-100 and that the 
carboxyl terminus of apoB-48 is in the vicinity of residue 2151 of apoB-lOO. 
Apolqioprotein B-48 may be fte product of an intestinal mRNA with an in-fiame 
UAA stop codon resultirig fix>m a C-to^U change in the codon CAA encoding 



wo 2004/018709 



PCT/EP2003/009126 



-18- 

GIn(21S3) in apoB-lOO mRNA. Since only the sequence that codes B-lOO is present 
in genomic DNA, this presmts the possibility of an oigan-spedfic introduction of a 
stop codon to an mRNA and the change from CAA to UAA of codon 2153 of the 
message as a unique RNA editing process.. 

APOD: apolipoprotein D 

Apolipoprotein D (Apo-D) is a component of hi^ density lipoprotein that has no 
marked similarity to other apolipoprotein sequences. It has a high degree of 
homology to plasma retinol-binding protein and other members of the alpha 2 
microglobulin protein superfamily of carrier proteins, also known as Upocalins. It is a 
glycoprotein of estimated molecular weight 33 KDa, Apo-D is closely associated 
with the enzyme lecitfainxholesterol acyltransferase • an enzyme involved in 
lipoprotein metabolism. 

Apolqpoprotein B 

Apolq>oprotein B (ApoB) is the main apoUpopmtein of chylomicrons and low 
density lipoproteins (LDL). The protein occurs in the plasma in 2 wiawi isofonns, 
q)oB-48 and q)oB-l 00. The first is synthesized exclusively by the gut, the second by 
the liver. The intestinal (B-48) and hepatic (B-lOO) fonns of ^B are coded by a 
single gCTC and by a single mRNA transcript largCT than 16 kb. The 2 proteins share 
a common amino terminal sequence. In the ApoB-lOO isofonn the precursor has 
4,563 amino acids, and the mature apoB-lOO has 4,536 amino acid residues. Mature, 
circulating B-48 is homologous over its entire length (estimated to be between 2,130 
and 2,144 amino acid residues) with the amino-terminal portion of B-lOO and 
contains no sequence fiom the catboxyl end of B-100. From structural studies, it is 
thought tfiat q)oB-48 represents tiie amino-terminal 47% of i^B-100 and ibat the 
caiboxyl terminus of apoB-48 is in the vicinity of residue 2151 of apoB-100. 
Apolipoprotein B-48 may be the product of an intestinal mRNA with an in-fiame 
UAA stop codon resulting fiom a C-to-U change in the codon CAA encoding 
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Gln(2153) in apoB-100 mRNA- Since only the sequence tibat codes B-lOO is present 
in genomic DNA, this presents flie possibility of an organ-specific introduction of a 
stop codon to an mRNA and flie change fiom CAA to UAA of codon 2153 of the 
message as a unique SNA editing process.. 

APXL: apical protein-like (Xenopus laevis) 

The protein encoded by this gene shares significant similarities wifh the ipical 
protein fixmi Xenopus laevis which is implicated in amiloride-sensitive sodium 
chaimel activity. This gene is a strong candidate gene for ocular albinism type 1 
syndrome. 

ARF4: ADP-ribosylation factor 4 

ADP-ribosylation factor 4 (ARF4) is a member of the human ARF gene femily. 
These genes encode small guanine nucleotide-binding proteins tbst sthnulate the 
ADP-ribosyltransferase activity of choloa toxin and play a role in vesicular 
trafiSddng and as activators of phosphol^ase D. The gene products include 6 ARF 
proteins and 11 ARF-like proteins and constitute 1 fionily of the RAS superfiunily. 
The ARF proteins are categorized as class I (ARFl, ARF2^ ARF3X class n 
(ARF4 and ARFS) and class m (ARF6). The membm of each class share a conunon 
gene organization. The ARF4 gene spans sqjproximately 12ld> and contains six exons 
and five intnons. The ARF4 is flie most divergent member of flie human ARFs. 
Conflicting positions at 3pl4 or 3p21 have been reported for this gene. 

ATF1A2: ATPase, Na+/K+ transporting, alpha 2 (+) polypeptide 

Alpha 2 subunit of Hit sodium- and potassium-transporting ATPase; required for 
Na+ and K+ gradient maintenance across plasma membrane. 
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ATPIBI: ATPase» Na+/K+ transporting, beta 1 polypeptide 

Beta 1 subunit of Naf/K+-ATPase. 

ATP1B3: ATPase, Na+/K+ transporting, beta 3 polypeptide 

Beta 3 subunit of tbe Na+/K+ -ATPase. 

ATP2A2: ATPase, Ca++ transporting, cardiac muscle, slow twitcb 2 

Slow twitch cardiac muscle Ca2+-ATPase; pvaops calcium, may have a role in 
calcium signaling pathways. 

ATP5G1: ATP synthase, H+ transporting, mitoehondrial FO compleic, subunit c 
(subunit 9), isofonn 1 

Isofonn 1 (PI) of subunit c, H-Ktranslocatmg siibunit of PO ATP synthase; catalyzes 
the synthesis of AT? during oxidative phosphoiylation. 

ATP6V1E: ATPase, H+ transporting, lysosomal 31kD, VI subunit £ 

This gene encodes a conqwnent of vacuolar ATPase (V-ATPase), a multisubunit 
enzyme that mediates acidification of eukaryotic intracellular organelles. V-ATPase 
dependent organelle acidification is necessary for such intracellular processes as 
protein sorting, zymogen activation, and receptor-mediated endocytosis. V-ATPase is 
comprised of a cytosolic VI domain and a transmembrane VO domain. The VI 
domain consists of a hexamw of three A and ttuee B subunits plus die C, D, and E 
subunits. It contains the ATP catalytic site. The encoded protein is known as ttie B 
subunit and is found ubiquitously. Pseudogenes for this g»e have been found in the 
genome. 
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ATPase, Ca++ transporting, cardiac muscle, fast twitch 1 

Fast-twitch skeletal muscle sarcoplasmic reticulum Ca2+-ATPase; pun^s calcium. 
AXINl: axin 

Strongly similar to murine Axin; may regulate embryonic axis formation. 

BMPRIA: bone morphogenetic protein receptor, type lA 

The bone morphogenetic protein (BMP) receptors are a femily of transmembrane 
serme/threonine kinases that mclude the type I receptors BMPRIA and BMPRIB 
and the type n recq)tor BMPR2. These receptors are also closely related to the 
activin receptors, ACVRl and ACVR2. The ligands of these receptors arc members 
of the TGF-beta superfemily. TGF-betas and activins transduce their signals thix)ugh 
the formation of heteromoic conq)lexes witti 2 different types of serine (Ihreonine) 
kinase recq>tors: ^e I recq)tors of about 50-55 kD and type n receptors of about 
70-80 kD. Type H recq>torB bind ligands in the absence of type I receptors, but fliey 
require their req)ecti ve type I recqitors for signaling, whereas type I leceptors require 
ihsk respective type n receptors for ligand binding. 

BRDS: bromodomain contafaiing3 

This gene was idratified based on its homology to the gene encoding the RING3 
protein, a serine/threonine kinase. The gene localizes to 9q34, a region which 
contains several major histocompatibility con^lex (MHQ genes. The function of the 
encoded protein is not known. 
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CACNAIC: calcium channel, voltage-dependent, L type, alpha IC snbunit 

Alpha IC subunit of the voltage-dependent calcium channel; channel is of the L typo 
and is e]q)ressed in the heart. 

CALB2: calbindin 2, (29kD, calretinin) 

Calbindin 2 (cahetinin), closely related to calbindin 1, is an intracellular calcium- 
binding protein belonging to the troponin C superfamily. Calbindin 1 is known to be 
mvolvcd in the vitamin-D-dcpendcnt calcium absorption through intestinal and renal 
q)ithelia, while the function of neuronal calbindin 1 and calbindin 2 is poorly 
understood. The sequence of the calbindin 2 cDNA reveals an open reading fiame of 
271 codons coding for a protein of 31,520 Da, and shares 58% identical residues with 
human calbindin 1. Calbindin 2 contains five presumably active and one presumably 
inactive caldum-binding domains. Comparison with the partial sequences available 
for chick and guinea pig calbindin 2 reveals that the protem is highly conserved m 
evolution. The calbindin 2 message was detected in the brain, while absent fiom 
heart muscle, kidney, liver, lung, q^leen, stomach and Ayroid gtmd. There are two 
additimal fonns of alternatively spliced calbindin 2 mRNAs encoding C-taminally 
truncated proteins. Exon 7 can qylice to exon 9. resulting in a fiame shift and a 
translational stop at tiie second codon of exon 9, and encoding calretinin-20k. Bxfm 7 
can also splice to exon 1 0, resulting m a fiame shift and a translational stop at codon 
15 of exon 10, and encoding cahetinin-22k. The truncated proteins are able to bind 
calcium.. 

CALCIUM-TRANSPORTING ATPASE PLASMA MEMBRANE, ISOFORMS 
3A/3B (EC 3.6.138) (CALCIUM PUMP) (PMCA3) 



Plasma membrane Ca2+-ATPase 3; pumps calcium. 
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CALM3: calmodulin 3 (phosphoiylase kinase, delta) 

Calmodulin 3; binds calcium. 

CAVl: caveolitt 1, caveolae protein* 22kD 

The scaffolding protein encoded by this gene is the main conqjonent of the caveolae 
plasma monbranes found in most cell types. The protein links integrin subunits to 
the tyrosine kinase FYN, an initiating step in coupling integrins to the Ras-ERK 
pathway and promoting cell cycle progression. The gene is a tumor suppressor gene 
candidate and a negative regulator of the Ras-p42/44 MAP kinase cascade. CAVl 
and CAV2 are located next to each other on chromosome 7 and ejq>ress colocalizing 
proteins that form a stable hetero-oligomeric complex. By using alternative initiation 
codons in flie same reading frame, two isofonns (alpha and beta) are encoded by a 
single transcript from this gene. 

CAV3:caveoIin3 

This gene encodes a caveolin &mily member, which functions as a conqxment of the 
caveolae plasma membranes found in most cell types, Caveolin protems m proposed 
to be scaffolding proteins for oiganiang and concentrating certain caveolin- 
intoacting molecules. Mutations identified in fliis gene lead to intofcrence with 
protem oUgomerization or intra-cellular routing, disrupting caveolae foraiation and 
resulting in Lhnb-Girdle muscular dystrophy type-lC (LGMD-IC), hypccCKemia or 
rippling muscle disease (RMD). Alternative splicing has been identified for this 
locus, with inclusion or exclusion of a differentially spliced intron. In addition, 
transcripts utilize multiple polyA sites and contain two potential translation initiation 
sites. 
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CCR2: chemokinc (C-C motif) receptor 2 

This gene encodes two isofonns of a recq)tor for monocyte chemoattractant protein- 
1> a chemokine which specifically mediates monocyte chemotaxis. Monocyte 
chemoattractant protein- 1 is involved in monocyte infiltration in inflammatoiy 
diseases such as ifaounatoid arthritis as well as in the inflammatoiy response against 
tumors. The recqrtors encoded by this gene mediate agonist-dependent .calcium 
mobilization and inhibition of adet^rlyl cyclase. This gene is located m the 
chemokine receptor gene cluster region. Two altematively spliced transcript variants 
are e>q7ressed by the gene. 

CDHl: cadberin 1, type 1, E-cadherin (epithelial) 

This grae is a classical cadherin from the cadherin superfamily. The encoded protein 
is a calcium dq^endent cell-cell adhesion glycoprotein comprised of five extracellular 
cadherin rq>eats, a transmraibrane region and a highly conserved cytoplasmic tail. 
Mutations in this gene are coirelaled with gastric, broast, colorectal, fliyroid and 
ovarian cancer. Loss of fimction is thought to contribute to progression in cancer by 
increasing proliferation, mvasion, and/or metastasis. The ectodomain of this protein 
mediates bacterial adhesion to mammalian cdls and flie cytoplasmic domain is 
required for intonalization. Identified transcr^t variants arise fiom mutation at 
consensus splice sites. 

CDHll: cadherin 11, type 2, OB-cadherin (osteoblast) 

This gene encodes a type n classical cadherin fiom the cadherin siq)er&mi]y» integral 
membrane proteins that mediate calcium-depradoit ceU-cell adhesion. Mature 
cadherin proteins are composed of a large N-terininal ejctracellular domain, a single 
membrane-spanning domain, and a small, highly conserved C-tenninal cytoplasmic 
domain. Type n (atypical) cadherins are defined based on their lade of a HAV cell 
adhesion recognition sequence specific to type 1 cadherins. Kqiression of this 
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particular cadherin in osteoblastic cell lines, and its upreguiation during 
dififerentiation, suggests a specific function in bone development and maintenance. 
Two splice variants have been identified, one of which encodes an isoforai with a 
truncated cytoplasmic domain. 

CDH13: cadherin 13, H-cadherin (heart) 

This gene is a monber of the cadherin siperfamily. The encoded protein is a calcium 
dependent cell-cell adhesion glycoprotein comprised of five extracellular cadherin 
repeats, a transmembrane region but, unlike the typical cadherin siq)erfamily 
.member, lacks the highly conserved cytoplasmic region. This particular cadherin is a 
putative mediator of ccU-cell interaction in the heart and may act as a negative 
regulator of neural cell growth. The gene locus is hypennethylated or deleted in 
breast, ovarian and lung cancers. Two major mRNA transcripts encoding identical 
proteins are found, products of alternative polyadoiylation sites. 

CENPCl: centromere protein C 1 

Centromere protein C 1 is a centromere autoantigen and a component of the inner 
kmetodiqro plate. The protein is required for maintaining, proper kinetochoro size 
and a timely transition to anq>hase. A putative psuedogene exists on chromosome 12. 

Cholesteiyl ester transfer protein (CETP) 

Cholestciy ester transfer protem (CETP) transfers cholesteryl esters betwem 
Upoproteins. CETP may efifect suscq)tibihty to atherosclerosis. 

CLCN4: chloride channel 4 

The CLCN fimily f voltage-dependent chloride diannel genes comprises mne 
members (CXCNl-7, Ka and Kb) vAdch demonstrate quite diverse fimctional 
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characteristics white sharing sigaijBcant sequence homology. Chloride chanpel 4 has 
an evohitionaiy conserved CpG island and is conserved in both mouse and hamster. 
This gene is mapped in close proximity to APXL (Apical protein Xenopus laevis- 
like) and OAl (Ocular albinism type which are botti located on the human X 
chromosome at band p22.3. The physiological role of chloride channel 4 
• unknown but may contribute to the pathogenesis of neuronal disorders. 

CLCNKA: chloride channel Ka 

; Putative chloride, channel; member of the CLC family of voltage-gated chloride 
channels. 

COL6A3; collagen, type VI, alpha 3 

This gene encodes the alpha 3 chain, one of the three alpha chains of type VI 
collagen, a beaded filament collagen found in most connective tissues. The a^ha 3 
cfanin of type VI collagen is much larger than the alpha 1 and 2 chains. This 
difference in size is largely due to an iiicriBase in the number of siibdomains, similar 
to von Willebrand Factor type A domains, found in the amino temiinal globular 
domain of all the alpha chains. These domains have been shown to bind extracellular 
matrix protems, an interaction ttiat explains the importance of ^s collagen in 
organizing matrix components. Mutations in the type VI collagen genes are 
associated with Bethlem myopathy. In addition to the full length transcript, four 
transcript variants have been identified that encode proteins with N-teraiinal globular 
domains of varying sizes. 

C0L7A1: collagen, type VII, alpha 1 (epidennotysis bullosa, dystrophic, 
dominant and recessive) 

Tins gene encodes the alpha chain of type Vn collagen. Tlie ty^ 

composed of three identical a^ha collagen chains, is restricted to the basement zone 
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beneath stratified squamous epitheKa. It fimctiwis as an anchoring fibril betweoi^ 
external q)ithelia and the underlying stroma. Mutations in this gene are. assodated 
with all forms of dystrophic epidermolysis bullosa. In tiie absence of mutatioiis, 
however, an acquired form of this disease can result firan an autoimmune respoase 
made to type vn collagen. 

COL9A3: collagen, type K, alpha 3 

This gene encodes one of the three alpha chains of type DC collagen, the major 
coUagen component of hyaline cartilage. Type DC collagen, a heterotrimeric 
molecule, is usually found in tissues containing type n collagen, a fibrillar collagen. 
Mutations in this gene are associated with multiple q>iphyseal dysplasia. 

COMT: catechoI-O-metfiyltransferase 

Catediol-O-methyltransferase catalyzes (he transfv of a mediyl groq> from S- 
adoiosyhneUiionine to catediolamines, including the neurotransmittears dopamine, 
epinephnnc, and noiq)inq>hrine. This O-m^ylation results in one of the migw 
degradative pafliways of the catecholamine trananitters. In additicm to its role in tiie 
metabolism of endogenous substances, CX)MT is impcHtant in tfie metabolism of 
catechol drugs used in tfie treatment of hypotrasion, asdmia, and Paiidnson dis^e. 
COMT is found in two forms in tissues, a soluble fonn (S-CX)MT) and a membrane- 
bound foim (MB-COMT). The difierences between S-COMT and MB-COMT reside 
witiiin die N-termini. The transcript variants are formed througji the use of alternative 
translati(m initiation sites and promoters. 

COXIO: COXIO homolog, cytochrome c oxidase assembly proton, heme A: 
famesyhransferase ^east) 

Cytochrome c oxidase (COX), fl»e tenninal componem of the mitochondrial 
respiratoiy diain, catalyzes fbc electron transfer fnm reduced Qrtochrome c to 
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oxygen. This component is a heteromeric complex consisting f .3 catalytic subunits 
encoded by mitochondrial genes and multiple stractural subunits encoded .by nuclear 
genes. The mitochondrially-encoded subunits function in electron transfer, and the 
nuclear-encoded subunits may function in the regulation, and assembly of the 
complex. This nuclear gene encodes heme Arfamesyltransferase, ^ch is not a 
structural subunit but required for the e}q)ression of functional COX and functions in 
the maturation of flie heme A prosthetic groi^ of COX. This protein is predicted to 
contain 7-9 transmembrane domains localized in the mitochondrial inner membrane. 
A gene mutation, which results in flie substitution of a lysine for an asparagine 
(N204K}, is identified to be responsible for cytochrome c oxidase deficiency. In 
addition, this gene is disrupted in patients with CMTIA (Charcot-Marie-Tooth type 
lA) di^)Ucation and with HNPP (hereditary neuropathy with Uabilily to pressure 
palsies) deletion. . 

CPB2: carbozypeptidase B2 (plasnia, carboxypeptidase U) 

Caiboxypeptidases are eazymcs that hydrolyze C-tenninal peptide bonds. Tbt 
carbo>9pq)tidase fiunily includes metallo-, serine, and cysteine caiboxypeptidases. 
According to Ifaeir substrate specifidty, fliese enzymes are refored to as 
caibo}Q^qptidase A (cleaving aliphatic residues) or caxboxypeptidase B (cleaving 
basic amino residues). The protein encoded by this gene is activated by tiypsin and 
acts on caiboxypq)tidase B substrates. After thrombin activation, fte mature protein 
downregulates fibrinolysis. Polymoiphisms have been described for this gene and its 
promoter region. Available sequence data analyses indicate splice variants that 
encode different isoforms. 

CPO: coproporphyrinogen oxidase (coproporphyria, harderoporphyria) 

Coproporphyrinogcn; catalyzes oxidative decarboxylation in sixth step of heme 
biosynthesis. 
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CRYAB: ciystaUin, alpha B 

CiystaUins are sqiarated into two classes: taxon-specific, or enzyme, and ubiquitous. 
The latter class constitutes ttie major proteins of vertebrate eye lens and mflfntaiTiy the 
transparency and refiactive index of flie lens. Since lens central fiber cells lose their 
nuclei during development, these ciystallins are made and flien retained throughout 
life, making them extremely stable proteins. Mammalian lens ciystallins are divided 
into alpha, beta, and gamma &milies; beta and gamma crystallins are also considered 
as a superfitfnily. Alpha and beta fimilies are lurflier divided into acidic and basic 
groups. Seven protein regions exist in ciystallins: four homologous motifs, a 
connecting peptide, and and C-tenninal extensions. Alpha ciystallins are 
con^sed of two gene products: alpha-A and alpha-B, for acidic and basic, 
respectively. Alpha crystallins can be induced by heat shock and are members of the 
small heat shock protein (sHSP also known as the HSP20) family. They act as 
molecular chsperones although they do not renature proteins and release tbem in the 
fashion of a true ch^erone; instead they hold them in large soluble aggregates. Post- 
translational modifications decrease the ability to chiq)erone. These heterogeneous 
aggregates consist of 30-40 subunits; the alpha-A and a^ha-B subunits have a 3:1 
ratio, respectively. Two additional Amotions of a^ha dystallins are an antnin'tiysf 
activity and participation in the intracellular ardiitecture. A^ha-A and a^ha-B gene 
products are diflFercntially expressed; alpha-A is preferentially restricted to the.lens 
and a^ha-B is ^pressed widely in many tissues and organs. Elevated expression of 
alpha-B crystallin occurs in many neurological diseases; a missense mutation 
cosegregated in a &mily with a desmin-related myopathy. 

CSF2RB: colony stimolating factor 2 receptor, beta, low-affinity (granulocyte- 
macrophage) 

CSF2RB is a common beta chain of the hig^ aflBnity receptor &t BL-3, IL-5 and 
CSF. Defective CSF2RB has been reported to be associated wifli protein ahreolar 
proteinosis. 
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CUBN: cubilin (intrinsic factor-cobalamin receptor) 

Cubilin (CUBN) acts as a receptor for intrinsic fector-vitamin B12 complexes, Ibe 
role of receptor is supported by the presence of 27 CUB domains. Cubulin is located 
within the epithelium of intestine and kidney. Mutations in CUBN may play a role in 
autosomal recessive megaloblastic anemia. 

CXorf6: chromosome X open reading frame 6 

CYP17: cytochrome P450, subfamily XVII (steroid 17-alpha-hydroxylase), 
adrenal hyperplasia 

This gene encodes a member of the cytochrome P450 superfiamily of enzymes. The 
cytochrome P450 proteins are monooxygenases which catalyze many reactions 
involved in drug m^aboUsm and synthesis of cholesterol, steroids and other lipids. 
This protein localizes to the endoplasmic reticulum. It has both 17alpha-liydroxylase 
and 17^0-lyase activities and is a key enzyme in the steroidogenic pathway tiiat 
produces progestins; mineralooorticoids, glucocorticoids, androgos, and estrogens. 
Mutations in this gme are assodated wifli isolated 5teroid-17 a^ha-hydroxylase 
deficiency, 17-a]pha-hydroxylase/17,20-lyase deficienqr, pseudohemq)hroditism, 
and adrenal hyperplasia. 

CYP2C8: cytochrome P450, subfamily IIC (mepbenytoin 4-bydrozyIase), 
polypeptide 8 

This gene encodes a member of the cytochrome P450 superfamily of enzymes. The 
cytochrome P450 proteins are monooxygenases which catalyze many reactions 
involved in drug metabolism and synthesis of cholestool, steroids and other Iqnds. 
This protein localizes to the endoplasmic reticulum and its expression is induced by 
phenobaibitaL The enzyme is known to metabolize many xenobiotics, mcluding the 
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anticonvulsive drug mephenytoin, ben2o(a)pyrenc, T-ethyoxycoumarin, and the anti- 
cancer drug taxoL Two transcript variants for this gene have been desmbed; it is 
thought that the longo- fonn does not encode an active cytochrome P450 since its 
protein product lacks the heme binding site. This gene is located within a cluster of 
cytochrome P4S& genes on chromosome 10q24. 

CYP2E: cytochrome P450, subfamily HE (ethanol-indudble) 

This gene encodes a member of the cytochrome P450 siq)erfamily of enzymes. The 
cytochrome P450 proteins are monooxygenases which catalyze many reactions 
involved in drug metabolism and synthesis of cholesterol, steroids and other hpids. 
This protein localizes to the endoplasmic reticulum and is induced by ethanol, the 
diabetic state, and starvation. The enzyme metabolizes bolh endogenous substrates, 
such as ethanol, acetone, and acetal, as well as exogenous substrates including 
benzene, carbon tetrachloride, ethylene glycol, and nitrosamines which are 
premutagens found in cigarette smoke. Due to its many substrates, this enzyme may 
be involved in sudi varied processes as gluconeogenesis, hepatic cinfaosis, diabetes, 
and cancer. 

CYP3A4 

This gene, CYP3A4, acodes a mwibcr of the cytochrome P450 sopcrfianily of 
enzymes. The cytodirome P450 proteins are monooxygenases whidi catalyze many 
reactions involved in drug metabolism and synthesis of diolesterol, steroids and 
other lipids. This protein localizes to the endoplasmic reticulum and its expression is 
induced by glucocorticoids and some pharmacological agents. This enzyme is 
involved in the metabolism of ^proximately half the drugs which are aro used today, 
including acetaminophai, codeine, cyclosporin A, diazepam and erytihromycio. The 
enzyme also metabolizes some steroids and caroinogens. This gene is part of a cluster 
f cytochrome P450 genes on chromosome 7q21.1. Previously another CYP3A gene, 
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CYP3A3, was thought to exist; however, it is .now thought, that this sequence 
rq)resaits a transcript variant of CYP3A4. 

CYP4Fg: cytochrome P450, subfamify IVF, polypeptide 8 

This gene, CYP4F8, encodes a member of the cytochrome P450 superfamiiy of 
enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many 
reactions involved in dmg metabolism and synthesis of cholesterol, steroids and 
other lq)ids. This protein localizes to the endoplasmic reticulum and functions as a 
19-hydroxylase of prostaglandins in seminal vesicles. This gaie is part of a cluster of 
cytochrome P450 genes on chromosome 19. Another member of this femily, 
CYP4F3, is approximately 1 8 kb away. 

CYP8B1: cytochrome P450, subfamily VmB (sterol 12-alpha*hydrozylase), 
polypeptide 1 

This gene encodes a membo: of the cytochrome P450 super&mily of enzymes. The 
cytochrome P450 proteins are monooxygenases "wiAct catalyze many reactions 
involved in dmg metabolism and synthesis of cholesterol, steroids and odier lipids. 
This endoplasmic reticulum membrane protein catalyzes the ccmversion of 7 alpha- 
hydroxy-4-cholestcn-3-one into 7-a^)ha,12-dphaHiihydroxy-4-cholesten-3-one. .The 
balance between these two steroids determines die relative amounts of diiolic acid 
and cfaenodeoxycholic add both of which are secreted in the bile and affect the 
solubility of cholesterol. This gene is unique among the cytochrome P4S0 genes in 
Aatitisintronless. 
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DBI: diazepam binding inhibit r (GABA recept r modolatorp acyl-Coenzyme A 
binding protein) 

Diazepam binding inhibitor (acyl-CoA-bmding protein); binds and induces medium- 
chain acyl-CoA ester synthesis. 

DEFA6: defensin, alpha 6, Panetfa cell-specific 

Defensms are a £unily of microbicidal and cytotoxic peptides tfaougiht to be iavolved 
in host defense. They are abundant in the granules of neutrophils and also found in 
the epithelia of mucosal surfaces such as those of the intestine, respiratory tract, 
urinary tract, and vagina. Members of the defensin family are highly similar in 
protein sequence and distinguished by a conserved cysteine motif Several alpha 
defensin genes appear to be clustered on chromosome 8. The protein encoded by this 
gene, defensin, alpha 6, is highly expressed in the seoetoiy granules of Paneth cells 
of the small intestine, and likely plays a role in host defense of human bowel. 

DEK: DEK oncogene (DNA bindmg) 

Site-specific DNA binding protein; involved in transcriptioiial regulation and signal 
transduction. 

DFNA5: deafness, autosomal dominant 5 

Hearing impaiment is a heterogeneous condition with over 40 loci described. The 
protein encoded by this gene is es^ressed in fetal codilea, however, its function is 
not known. Nonsyndromic hearing in^ainnent is associated with a mutation in this 
gene. 
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DGKD: diacylglycer 1 kinase, delta (130kD) 

Diacylglyceiol kinase delta; pbosphorylates the arachidonoyl ^e of diacylgtycaol; 
contains a pledcstrin homology domain and an EPH domain. 

DOCKl: dedicator of cyto-kinesis 1 

Dedicator of cyto-kinesis 1 binds to the SH3 domain of CRK protein. It may regulate 
cell surface extension and may have a role in the cell surface extension of an 
engulfing cell around ^ dying cellduring qx)ptosis. 

ECEl: endotfaelin converting enzyme 1 

Endothelin converting enzyme; metalloprotease diat regulates a peptide involved in 
vasocontriction. 

£-Selectin (CD62E) 

The endothelial leukocyte adhesion molecule-1 is expressed by cytokine-stimulated 
endodielial cells. It is thought to be responsible for the accumulation of blood 
leukocytes at sites of inflammation by mediating the adhesion of cells to flie vascular 
lining. It ediibits structural features such as the presence of lectin- and EGF-like 
domains followed by short consensus repeat (SCR) domains that contain 6 conserved 
cysteine residues. These proteins are part of the selectin family of cell adhesion 
molecules. This gene is present in single copy in the human genome and contains 14 
exons spanning about 13 kb of DNA. Adhesion molecules participate in the 
interaction between leukocytes and die endothelium and appear to be involved in the 
pathogenesis of atherosclerosis. 
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ESRl: estr gen receptor 1 

Estrogen recepton nuclear receptor transaiption fictor activated by ligand-binding, 
involved in hormone-mediated inhibitionof gene expression. 

ESR2: estrogen receptor 2 (ER beta) 

Estrogen receptor beta 2; transcriptianal activator involved in regulation of 
reproduction; exists in five isofoims. 

F2: coagulation factor II (thrombin) 

Coagulation factor n is proteolytically cleaved to form thrombin in the first step of 
the coagulation cascade which ultimately results in the stemming of blood loss. F2 
also plays a role in maintaining vascular integrity during development and postnatal 
life. Mutations m E2 leads to various forms of Oronibosis and dysprothrombinemia. 

F3: coagulation factor m (thromboplastin, tissue factor) ' 

This gene encodes coagulation fictor m vAddh is a cell sur&ce glycoprotein. This 
fector enables cells to initiate flie blood coagulation cascades, and it fimctions as flie 
high-aflBniQr receptor for die coagulation fictor VH The resulting complex provides 
a catalytic event that is responsible for initiation of the coagulation protease cascades 
by specific limited proteolysis. Unlike tiie other cofectors of these protease cascades, 
which circulate as nonfunctional precursors, this fictor is a potent initiator that is 
fiilly functional when expressed on cell surfaces. There are 3 distirict domains of flus 
factor extracellular, transmembrane, and cytoplasmic This protein is tiie only one in 
the coagulation patiiway for which a congenital deficiency has not been described 
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F5: coagulation factor V (proaccelerin, labile factor) 

This gene encodes coagulation factor V which is an essential , factor of the blood 
coagulation cascade. This £u^tor drculates in plasma, and is converted to the active 
S fonn by fht release of the activation peptide by thrombin during coagulation. This 
generates a heavy chain and a Kght chain which are held together by calcium ions. 
The active fector V is a cofactor that participates witfi activated coagulation factor X 
to activate protoombin to thrombin. Defects in this gene result in either an 
autosomal recessive hanonfaagic diathesis or an autosomal dominant form of 
10 duombqphilia, which is known'as activated protein C resistance. 

F7: coagulation factor VII (smun prothrombin conversion accelerator) 

This gene encodes coagulation factor VII which is a vitamin K-dependent fector 
15 essential for hemostasis. This factor circulates in the blood in a zymogen form,-and is 
converted to an active form by dtho- fector IXa, fector Xa, fector Xlla, or thrombin 
by minor proteolysis. Upon activation of the fector VII» a heavy chain containing a 
catalytic domain anda light chain containing 2 EGF-like domains are generated, and 
two chains are held tog^er by a disulfide bond In the presence of fector m and 
20 calcium ions, ttxe activated fector thm fiirthq- activates the coagulation cascade by 
convertixig fector IX to factor IXa and/or fector X to fector Xa. Alternative splicing 
of this gene results in 2 transcripts. Defects in this gene can cause coagulopathy. 

F9: coagulation factor IX (plasma thromboplastic component^ Christmas 
25 disease, hemophilia B) 

This gene encodes vitamin K-dependent coagulation factor K that circulates in tbt 
blood as an inactive zymogen. This factor is converted to an active form by fector 
XIa, whidi excises &e activation pq)tide and thus generates a heavy chain and a light 
30 chain held together by one or more disulfide bonds. The role of this activated fector 
DC in the blood coagulation cascade is to activate fector X to its active form through 
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interactions with Ca+2 ions, membrane phospholipids, and factor. VIE Alterations of 
this gene, including point mutations, insertions and deletions, cause, fector K 
deficiency, which is a recessive X-Iinked disorder, also called hemophilia B or 
Christmas disease. 

FABP3: fatty add binding proteiii 3, muscle and heart (mammary-derived 
growth inhibitor) 

The intracellular fatty acid-binding proteins (FABPs) belongs to a multigene family. 
FABPs are divided into at least ifliree distinct types, namely the hq)atic-, intestinal- 
and cardiac-type. They fomi 14-15 kDa proteins and are thought to participate in the 
i5)take, intracellular metabolism and/or transport of long-chain fatty acids. They may 
also be responsible in the modulation of cell growth and proKferation. Fatty acid- 
binding protein 3 gene contains four exons and its function is to arrest growth of 
mammary q)ithelial cells. This gene is a candidate tumor suppressor gene for human 
breast cancer. 

FACX3: fatly««cid-Coeiizyme A ligase, long-chain 3 

The protein encoded by this gene is an isozyme of tfie long-chain fatty-add- 
coen^e A ligase fionily. Although differing in substrate specificity, subcellular 
localization, and tissue distribution, all isozymes of this &mily convert fiee long- 
chain fifty adds into fatty acyl-CoA esters, and fliereby play a key role in lipid 
biosynthesis and fitty acid degradation. This isozyme is highly eacpressed in brain, 
and preferentially utilizes myristate, arachidonate, and dcosapentaenoate as 
substrates. The amino acid sequence of this isozyme is 92% identical to that of rat 
homolog. 
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FACL4: fatfy-acid-Coenzyme A ligase, long-chain 4 

The protein encoded by this gene is an isozyme of flie long-chain fetty-acid- 
cocnzyme A ligase family. Although diflfering in substrate specifidty, subcellular 
localization, and tissue distribution, all isozymes of this family convert ftce long- 
chain fatty acids into fatty acyl-CoA esters, and fliereby play a key lole in lipid 
biosynthesis and fatty acid degradation. This isozyme preferMtially utilizes 
arachidonale as substrate. The absence of this enzyme may contribute to the mental 
retardation or Alport syndrome. Alternative splicing of this gene generates 2 
transcript variants. 

FMOl: flavin containing monooxygenase 1 

Metabolic N-oxidation of the diet-derived amino-trimetfaylamine (TMA) is mediated 
by flavin-containing monooxygenase and is subject to an inherited FM03 
polymorphism in man resulting in a small sidypopulation with reduced TMA N- 
oxidation cqjacity resulting in fish odor syndrome Trimethylaminuria. Three forms 
of the enzyme, FMOl found in fetal liver, FM02 found in adult liver, and FM03 aie 
encoded hy genes clustered in flie Iq23-q25 region, Flavin^ontaining 
mcmooxygenases are NADPH-dq)endent flavoeozymes that catalyzes die oxidation 
of soft nucleophilic heteroatom centers in drugs, pesticides, and xenobiotics. 

GAA: glucosidase, alpha; add (Pompe disease, glycogen storage disease type H) 

This gene encodes acid alpha-glucosidase, which is essential for the degradation of 
glycogen to glucose in lysosomes. Different forms of acid alpha-glucosidase are 
obtained by proteolytic processing. Defects in this gene are the cause of glycogen 
storage disease n, also known as Ponce's disease, which is an autosomal recessive 
disorder widi a broad clinical spectrum. 



wo 2004/018709 



-39- 



PCT/EP2003/009126 



GAPD: giyceraldehyde-3-phosphate dehydrogenase 

Glyceraldehyde*3-phosphate <lehydrogeDase catalyzes an important energy-yielding 
step in caibohydnite metabolism, tbe reversible oxidative phosphorylation of 
glyceraldebyde-3-phosphate in the presence of inorganic phosphate and nicotinamide 
adenine dinucleotide (NAD). The ens^e exists as a tetramar of idmtical chains. A 
GAPD pseudogene has been tadpped to Xp21-pl 1 and 15 GAPD-like loci have been 
identified. 

GARS: glycyl-tRNA synthetase 

Aminoacyl-tRNA synthetases are a class of enzymes that charge tRNAs with their 
cognate amino acids. Glycyl-tRNA synflietase is an (alpha)2 dimer which belongs to 
the class n femily of tRNA synthetases. It has been shown to be a target of 
autoantibodies in the hmnan autoimmmie diseases, polymyositis or dermatomyositis. 

GBEl: glucan (l94-alpha-), branching enzyme 1 (glycogen brmnching etaymtj 
Andersen disease^ glycogen storage disease type IV) 

This monomeric enzyme fimctioDS in glycogen symthesis by catalyzing the 
formation of alpha 1,6- glucosidic liiikages. It is most highly- eqires^ed in Uver and 
muscle. Deficiency can result in glycogen storage disease IV (Andosen's disease). 

GP6: glycoprotein VI platelet) 

Platelet glycoprotein VI; member of the paired Ig-like recq)tor &mily. 
GPR*S5 

Member of the G protein-coupled recq)t(ir fionily. 
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GPRC5C: G protein-coupled receptor, family C, group 5, member C 

The protein encoded by this gene is a member of the type 3 G piotein-coi5)led 
receptor fmiily. Monbers of fins super&mily are characterized by a signature 7- 
transmembrane domain motif. The specific function of this protein is unknown; 
however, Ms protein may mediate the cellular eflfects of retinoic acid on the G 
protein signal transduction cascade. Alternative splicing in flie 5* UTR of this gene 
results in two transcript variant$. 

3-hydroxy-3-methylgfaitai7l coenzyme A synthase 

3-hydroxy-3-methylglutaryl-Coenzyme A synthase; functions in the first stq) in 
ketogenesis. 

HKl: hexokinase 1 

Hexokinases pho^horylate glucose to produce glucose-6-phosphate, flius 
committmg glucose to the glycolytic pa&way. This gene encodes a ubiquitous form 
of hexokinase which localizes to the outer membrane of mitochondria. Mutations in 
this geac have been associated with hemolytic anemia due to hexokinase defideocy. 
Alternative splicing of this gene results in five transcript variants which encode 
different isofonns, somls of which are tissue-specific. Each isoform has a distinct N- 
temiinus; the remainder of the protein is identical among all flie isofonns. A sixth 
transcript variant has been described, but due to the presence of several stop codons, 
it is not thought to encode a protein. 

HLA-B associated transcript 3 (BATS) 

A cluster of genes, BAT1-BAT5, has been localized in the vicinity of the genes for 
TNF alpha and TNF beta These genes are all within tfie human major faisto- 
conq)atibility complex class in region. The protein encoded by this gene is a nuclear 
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proteiiL It has been implicated in the control of qjoptosis and regulating heat shock 
piotdn. Thoe are three alternatively spliced transcript variants described for this 
gene. 

HMGCL: 3-hydroqinetiiyI-3-inethylgIataiyl-Coeii2yme A lyase (hydrozymeth- 
yl^utaricaciduria) 

3-Hydroxy-3-methylglutaryl coen^e A lyase; cleaves 3-OH-3-methylglutaiyl CoA 
to acetoacetic acid and acetyl CoA. 

HNF4A: hepatocyte nuclear factor 4, alpha 

Nuclear hormone recq)tor transcription factor, regulates liver specific gene ex- 
pressioa 

Chromosome 12 BAG RP11-13J12 
Cathepsin B 

Ca&epsin B;. lysosomal cysteine (tbiol).pn>tease that cleaves APP. 
Chromosome 5 clone CTD-2235C13 
Chromosome 7 done RP11-351B12 
cytochrome P4S0 3A locus 

The CYP3 A locus includes all the known membeis of the 3A sub&mily of the cyto- 
chrome P450 siq)er&mily f genes. These genes encode monooxygenases which 
catalyze many reactions involved in drug metabolism and syn&esis of cholestml, 
stooids and other lipids. The CYP3A cluster consists of four genes, CYP3A43, 
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CYP3A4, CYP3A7 and CYP3A5. The region also contains, two pseudogcnes, 
CYP3A5P1 and CYP3A5P2, as well as several extra exons whidi may or may not be 
incliided in transcripts produced fixim this region. Previously another CYP3A 
member, CYP3A3, was thought to exist; however, it is now thought that this 
sequence represents a transcript variant of CYP3A4. 

ITGB3 

The ITGB3 protein product is the int^grin beta chain beta 3. Intcgrins arc integral 
cell-sur&ce proteins composed of an alpha chain and a beta chain. A given chain 
may combine with multiple partners rreulting in different integrins. Integrin beta 3 is 
found along with the alpha lib chain in platelets. Integrins are known to participate in 
cell adhesion as well as cell-sur&ce mediated signalling. 

Methionine adenosyltransferase alpha subunit gene fragment 

MATIA encodes methionine adcnosyltFansferase I (alpha, isofomi). MATIA 
catalyzes the fonnation of S-adenos^methionine fiom methionine and ATP. Both 
the beta and alpha isofimns m^ be encoded by MATIA. Mednonine adenosyl- 
transferase defidency is known to be caused by recessive as well as dominant 
mutations, the latter identified in autosomal dominant pmistant hyper- 
meihioninemia. 

Homo sapiens PAC clone RP1-102K2 from 22ql2.1-qter 

Homo sapiens partial 2^NF202 gene for zinc finger protein homolog, exon 4 

Zinc*finger protein 202 may repress genes involved in lipid metabolism; contains 
zincfingers. 



wo 2004/018709 



PCT/EP2003/009126 



.43. 

Homo sapiens vHNFl-C mRNA 

Hepatocyte Nuclear Factor 1. 

Human 2:5 kb mRNA for cytoskeletal tropomyosin TM30(nm) 
Hmnan c-kit gene 

KIT encodes the human homolog of flie proto-oncogenc c-kit. C-kit was first 
identified as the cellular homolog of the feline sarcoma viral oncogene v-kit KIT is a 
type 3 transmembrane receptor for MGF (mast cell growth fector, also known as 
stem cell factor). Mutations in KIT are associated with gastrointestinal stromal 
tumors, mast cell disease, acute myelogenous lukemia, and piebaldism. 

Human coagulation factor VH (JH) gene exon 1 and factor X (FIO) gene, exon 1 

This gene mcodes coagulation factor VII v/lnch is a vitamin K-dependent &ctar 
essential for hemostasis. This' factor circulates in the blood m a ^nnogen fonn, and is 
converted to an active form by either fictor Ka, fiswtor Xa, fiwtor Xlla, or fhiombin 
by minor proteolysis. Upon activation of &e fictor Vn» a heavy chain contammg a 
catalytic domain and a light cham containing 2 EGF-Iike domains are generated, and 
two chams are held together by a disulfide bond. In the presence of factor m and 
calcium ions, the activated fadtor dien further activates the coagulation cascade by 
converting factor K to factor IXa and/or factor X to fiactor Xa. Alternative splicing 
of this gene results in 2 transcripts. Defects in ^s gene can cause coagulopathy. 

Human cytochrome P4S0 (CYP1A2) gene, exons 1 and 2 

This gene, CYP1A2, encodes a monber f the cytochrome P450 siq)eifemily of 
en^raes. The cytochrome P450 protems are monooxygenases ^ch catalyze many 
reactions mvolved in drug metabolism and synthesis of cholesterol, steroids and 
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Other lipids. The protein encoded by this ^ene localizes to the endoplasmic leticQlum 
and its e:qjression is induced by some polycyclic aromatic hydrocarbons (PAHs), 
some of D(4iich are found in cigarette smoke. The enzyme's endogoious substrate is 
unknowa; however, it is able to metabolize some PAHs to carcinogemc 
intermediates. Other xenobiotic substrates for ttiis oozyme include caffeine, aflatoxin 
Bl, and acetaminophen. The transcript fixan this gene contains four Ata sequences 
flanked by direct repeats in the 3' untranslated region. A related femily member, 
CYPlAl, is located tqiproximately 25 H) away from CYP1A2 ondiromosome . 

Human multidrug resistance-associated protein mRNA 

SeeABCCl. 

Human sncdnyl CoA:3-ozoacid CoA transferase precursor (OXCT) mRNA 

The mitodiondrial matrix enzyme 3-oxoacid CoA transferase is homodimeric. It is a 
key enqone in the extrahq)atic utilization of ketone bodies, catalyzing fbe reversible 
transfer of coenzynie A fixim succmyl-CoA to acetoacetate, a necessary step in 
ketolytic energy productimL Defidendes can result in intetmitteot ketoaddosis. 

Human T-lymphoma invasion and metastasis indudng TIAMl protein 
(TIAMl)mRNA 

Member of die GDP-GTP exdiange factor family of proteins; modulates die activity 
of Kho-like proteins; has a Dbl homology and pledcstrin homology domains. 

n^lO: interlenldn 10 

Literleukin 10 (cytokine synthesis inhibitory &ctor); fimcti ns as a specific 
diemotactic &ctor for CD84-T cells. ' 
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IL17R: interleukin 17 receptor 

Highly similar to miuine I117r, may play a role in T cell activation and induction of 
IL-2(I12). 

IL3: interleukin 3 (colony^^timalating factor, multiple) 

Interleuldn-3 (colony-stimulating factor); plays a role in hematopoeisis; member of a 
&mily of growfli factors. 

IL6; interleukin 6 (interferon, beta 2) 

Interleukin 6 (interferon-beta 2); induces the maturation of B cells into 
immunoglobulin-secreting cells. 

IL8RA: interleukin 8 receptor, alpha . 

Interleukin 8 receptor alpha; G protdn-coiqiled receptor that mediates neutrophil 
chemotaxis and binds interleukin 8 (IL8); 

INHBC: inhibin, beta C 

This gene, encodes the beta C chain of inhibin, a member of the TGF-beta 
siqjerfamily. This subunit forms heterodimers with beta A and beta B subunits. 
Inhibins and activins, also members of the TGF-beta siq)erfamily, are hormones with 
opposing actions and are involved in hypothalamic, pituitary, and gonadal hormone 
sec^on, as well as growth and differentiation of various cell types. 
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ITGAL: integrm, alpha L (antigen CDllA (pl80)» lymphocyte function- 
associated antigen 1; alpha polypeptide) 

ITGAL encodes the integrin alpha L chain. Int^grins are heterodimeric integral 
membrane proteins composed of an alpha chain and a beta chain. This I-domain 
containing alpha integrin combines with the beta 2 chain (ITGB2) to form die 
integrin lynq)hocyte function-associated antigen-1 (LFA-l), which is expressed on 
all leukocytes. LFA-1 plays a central role in leukocyte intercellular adhesion through 
interactions with its ligands, ICAMs 1-3 (interceUular adhesion molecules 1 through 
3), and also functions in lymphocyte costimulatory signaling. 

ITGB2: integrin, b^ 2 (antigen CD18 (p95), lymphocyte function-associated 
antigen 1; macrophage antigen 1 (mac-1) beta subunit) 

The nGB2 protein product is the integrin beta chain beta 2. Intpgrins are integral 
. cell-sur&ce protems coniposed of an a^ha chain and a beta chain. A given dudn 
may combine with multiple partners resulting in different integrins. For example* 
beta 2 combines with the alpha L cham to form die integrin LFA-1, and combmes 
with die a^ha M cham to form the intpgrin Mac-1 . hitegrins are known to particqmte 
in cell adhesion as wdl as cell-surfiice mediated signalling. 

KCNQl: potassiumvoltage-gated channel, KQT^like subfamily, member 1 

KCNQl encodes the K+ channel subunit responsible for the delayed-rcctifier K+ 
current in cardiac myocytes. The delayed-rectifier channel is conq)leted by die 
protein encoded by KCNEl. Mutations in KCNQl cause, inherited long-QT 
syndrome. 
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LAMA3: laminin, alpha 3 (nicein (ISOkD), kalinin (165kD), BM600 (ISOkD), 
epflegrin) 

Laminins aie basement memhrane components fliought to mediate fhp attachment, 
S migration and organization of cells into tissues during embryonic development by 
interacting with other extracellular matrix components. The protein encoded by this 
gene is the alpfaa-3 chain of laminin 5, which is a complex glycoprotein composed of 
ttiree subunits (alpha, beta, and gamma). T^minin 5 is thought to be involved in cell 
adhesion, signal transducticHi and differentiation of keratinocytes. Mutations in this 
gene have been identified as the cause of Herlitz type junctional epidermolysis 
bullosa Alternative splicing has been observed at this locus but the fuU-loigth nature 
of these variants has not been determined. 

LAMRl: laminin receptor 1 (67kD, ribosomal protein SA) • 

T i aminins , ^ a fimiily of extracellular matrix ^ycoproteins» are the nugor non- 
coUagenous constituent of basement membranes. They have be» implicated in a 
wide variety of biological processes including cell adhesion, differentiadon, 
migration, signaling, neuiile outgrowth and metastasis. Many of Ae effects of 
laminm are mediated tiirougli interactions with cell surface recq)tor5. These recqitors 
include members of the im^rin family, as well as non-integrin laminin-binding 
proteins. This gene encodes a high-aflSnity, non-integrin femily, laminin receptor 1. 
This receptor has been variously called 67 kD laminin receptor, 37 kD laminin 
receptor precursor (37LRP) and p40 ribosome-associated protein. The amino add 
sequence of laminin recq)tOT 1 is highly conserved througji evolution, suggesting a 
key biological function. It has been observed that the level of the laminin recq)tor 
transcript is higher in colon carcinoma tissue and lung cancer cell line than Aeir 
normal counterparts. Also, there is a correlation betwem the upregulation of this 
polypq>tide in cancer cells and their invasive and metastatic phenotype. Multiple 
copies of ttiis gene exist, however, most of them are pseudogenes thou^t to have 
arisen fixmi retropositioiud events.. 
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LDLR: low density lipoproteui receptor (familial hypercholesterolemia) 

The low density lipoprotein receptor (LDLR) gene family consists of cell surfece 
proteins involved in receptor-mediated endocytosis of specific ligands. Low density 
lipoprotein (LDL) is normally bound at the cell membrane and taken into the cell 
ending up in lysosomes where the protein is degraded and the diolesterol is made 
available for repression of microsomal enzyme 3-hydroxy-3-methylglutaiyl 
coenzyme A (HMG CoA) reductase, the rate-limiting step in cholesterol synthesis. 
At the same time, a reciprocal stimulation of cholesterol ester synfliesis takes place. 
Mutations in the LDL receptor (LDIA) gene cause the autosomal dominant disorder, 
familial hypercholesterolemia. 

LGALS7: lectin^ galactoside-brnding, soluble, 7 (galectin 7) 

The galectins are a £unily of beta^galactoside-bindmg piotdns implicated in 
modulating ccll-cell and cell-matrix interactions^ Differential and in situ 
hybridizations indicate diat this lectin is specifically esqnessed in keratinocytes. ft is 
e^qiressed at all stages of epidomal differentiation (i.e., in basal and sqnabasal 
layas). It is moderately repressed by retinoic acid. The protem was found mainly in 
stratified squamous epitheliunt- The antigim localized to basal keratinoQics, 
although it was also found, albeit at lower levels, in the si^rabasal layers where it 
concentrated to areas of cell-to-cell contact. The cellular localization and its striking 
down-regulation in cultured keratinocytes imply a role in cell-cell and/or cell-matrix 
interactions necessary for normal growth control. 

UMKl: LIM domain kinase 1 

There are approximately 40 known eukaiyotic LIM proteins, so named for the LIM 
domains &ey contain. LIM domains are highly conserved cystdne-rich structures 
containing 2 zinc fingers. Although zinc fingers usually fimction by bmding to DNA 
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or RNA, the LIM motif probably mediates protein-protein interactions. LIM kinase-1 
and LIM kinase-2 belong to a small subfemily with a unique combination of 2 N- 
terminal LIM motifs and a C-tcnninal protein kinase domain. LIMKl is likely to be a 
con^n^t of an intracellular signaling pathway and may be involved in brain 
development UMKl hemizygositjr is inq)licated in the impaired visuospatial 
constnictive cognition of Williams syndrome. Two splice variant have been 
identified. 

LMNB2:laminB2 

Lamin B2; member of a family of structural nuclear envelope proteins. 
LPL: lipoprotein lipase 

LPL encodes Upoprotein lipase, which is expressed in heart, muscle, and adipose 
tissue. LPL functions as a homodimer, and has the dual functions of triglyceride 
hydrolase and ligand/toidging &ctor for recq>tor-mediated lipppiotdn iq>take. Severe 
mutations that cause LPL deficiency result in typo I hyperiipoproteinemia, vAnlt less 
extreme mutations in LPL are linked to many disorders of lipoprotem metabolism. 

LRP8: low density lipoprotein receptor-related protein 8, apolipoprotein e 
receptor 

This gene mcodes an apolipoprotein E receptor, a member of the low density 
lipoprotein receptor (LDLR) family. ^Upoprotein E is a small lipophiKc plasma 
protein and a con^onent of hpoproteins such as chylomicron remnants, very low 
density lipoprotein (VLDL), and high density lipoprotein (HDL). The apolipoprotein 
E receptor is involved in cellular recognition and internalization of these lipoproteins. 
Alternative splicing generates three transcript variants for this gen^ additional 
variants have been descnl>ed, bm Aeir fiiU length nature has not been dete^ 
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LSS: lanosterol synthase (2^-oxidosqualene-lano8teroI cyclase) 

Lanosterol synthase ((S)-23-cpoxysqualene mutase); catalyzes the cyclization of (S)- 
2><>xidosqualene; fonns lanosterol during sterol biosy^ 

LTA: lymphotoxin alpha (TNF superfamily, member 1) 

Lymphotoxin alpha, a member of the tumor necrosis factor femily, is a cytokine 
produced by lymphocytes. LTA is highly inducible, secreted, and exists as 
homotrimeric molecule. LTA forms heterotrimers with lynqjhotdxin-beta which 
andiors lymphotoxin-alpha to the cell surface. LTA mediates a large variety of 
inflammatory, immunostunulatory, and antiviral responses. LTA is also involved m 
the fonnation of secondary lymphoid organs during development and plays a role in 
apoptosis. 

MAOA: monoamine oxidase A 

MAOA encodes monoamine oxidase A, an enzyme that degrades amine nemo- 
transmitters, such as dopamine, norepiiiq>farine, and serotonin. Deficiency of this 
enzyme results in Brunner syndrome. 

MARCKS: myristoylated alanine-rich protein kinase C substrate 

The protein encoded by this gene is a substrate for protein kinase C. It is localized to 
the plasma membrane and is an actin filament crosslinking protein. Phosphoiylation 
by protein kinase C or binding to calciiun-calmodulin inhibits its association with 
actin and with the plasma membrane, leading to its presence in the cytoplasm. The 
protein is thought to be involved in cell motility, phagocytosis, membrane tra£Bcking 
andmitogenesis. 
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MCLl: myeloid cell leukemia sequence 1 (BCL2-related) 
Similar to BCL2. 

MCP: membrane cof actor protein (CD469 tropliobla8t--Iympliocyte cross* 
reactive ant^en) 

Membrane cofactor protein; acts as the receptor for the measles virus, may be 
involved in tiie regulation of complement activation; contains SCRs. 

METTLl: methyhransferase-like 1 

This gene is an ortholog of the S. cerevisiae YDL201w gene, which is predicted to 
encode a methyHransferase. The gene product contains a conserved S-adenosyl- 
methionme-binding motiJ^ which is typical of a metfayltxansferase. Alternative q>lice 
variants encoding different protein isofomos and transct^t variants utilizing 
alternative polyA sites have been described in the literature. 

MLLT3: myeloid/l^phoid or mixed-lineage leukemia (trithorax homolog, 
Drosophila); 

Serine and prolme rich protein, has a nuclear targeting sequence. 

MTHFDl: metfaylenetetrahydrofolate dehydrogenase (NADP+ dependent), 
methenyltetrahydrofolate cydohydrolase, formyltetrahydrofolate Qnthetase 

This gene encodes a protem that possesses three distinct enzymatic activities, S,IO- 
meAylenetetrahydrofolate ddiydrogenase, S,10-mefl)enyltetnihydn>fi>late cyclo- 
hydrolase and lO-fomyltetrahydiofolate syndietase. Each of these activities 
catalyzes one of three sequential reacdcms in the interconversion of l*caibon 
derivatives ftetidiydrof late, whidi are substrates fcv methionine 
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de novo purine syntheses. The trifimctional enzymatic activities are confened by two 
major domains, an aminoterminal portion containing the dehydrogenase and 
cyclohydrolase activities and a larger synthetase domain. 

MTMR2 myotnbularin related protein 2 (MTMR2) 

This gene is a member of the myotubularin fionily and encodes a putative tyrosine 
phosphatase. Mutations in this gene are a cause of Charcot-Marie-Tootfa disease type 
4B, an autosomal recessive demyelinating neuropathy. This gene utilizes multiple 
polyA signals, only one of which has been detennined. 

Muscle specific serine kinase (MSSKl; serine/threonine kinase 23, STK23), 

Hi^y similar to SRPK2; may be protein kinase for SR fiamily of RNA splicing 
fectbrs; contains a kinase domain. 

MVD: mevalonate (diphospho) decarbo^Iase 

The enzyme mevalonate pyrophosphate decaibos^lase catalyzes the conversion of 
mevalonate pyrophosphate into isopentenyl pyrophosphate in one of the early steps 
m cholestool biosynthesis. It decarbo^lates and dei^drates its substrate while 
hydrolyzingATP. • 

MYHll: myosin, heavy polypeptide 11, smooth muscle 

The protein encoded by this gene is a smooth muscle myosin belonging to tiic 
. myosin heavy chain family! The gene product is a subunit of a hexamcric protein that 
consists of 2 heavy chain subunits and 2 pairs of non-identical light chain subunits. It 
functions as a maj r contractile protein, convoting chemical energy into mechanical 
energy throu^ the hydrolysis of ATP. The gene encoding a human or&olog of rat 
NUDEl is transoibed from the reverse strand of MYHll gene, and its 3* md 
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overl^s with that of the latter. The pericentric inversion of chromosome 16 
[inv(16)(pl3q22)] produces a chimeric transcript consisting of the first 165 residues 
from the N terminus of core-binding factor beta in a fusion witih the C-tenninal 
portion of the smooth muscle myosin heavy chain.- This chromosonud rearrangement 
is associated with acute myeloid leukoma of the M4Eo subtype. Alternative splidng 
generates isofotms that are di£ferentiaUy expressed, with ratios changing during 
muscle cell maturation. Additional splice variants have been described but their full* 
length nature has not been determined. 

MYH7: myosin, heavy polypeptide 7, cardiac muscle, beta 

MYH7 encodes the cardiac muscle beta (or slow) isofonn of myosin. Changes in the 
relative abundance of MYH7 and MYH6 (the alpha, or fast, isofoim of cardiac 
myosm heavy chain) correlate with the contractile velocity of cardiac muscle. 
Mutations in MYH7 are associated with &mi]ial hypertrophic cardiomyopathy. 

NADH dehydrogenase (ubiquinone) 1, alpha subcomplex, 4 (9kD, MLRQ), 
NDUFA4 

Subunit of NADH-ubiquinone oxidoreductase (conq>lex I); transports electrons from 
NADH to ubiquinone. ' 

NADH-UBIQUmONE OXIDOREDUCTASE CHAIN 5 (EC 1,6,53). 

Subunit of NADH-ubiquinone oxidoreductase (complex I); transports electrons fiom 
NADH to ubiquinone. 
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NDUFA9: NADH dehydrogenase (ubiquinone) 1 alpha subcomplex, 9 p9kD) 

NGFB: nerve growth factor, beta polypqytide 

Nerve growdi factor beta; has roles in neuronal differentiation and survival 

NGFR: nerve growth factor receptor (TNFR superfamily, member 16) 

Nerve growth fitctor receptor contains an extracellular domain containing four 40- 
amino acid repeats with 6 cysteine residues at conserved positions foUowed by a 
serine/tbreonine-iich region, a single transmembrane domain, and a 155-amino acid 
cytoplasmic domain. The cysteine-rich region contains the nerve growth factor 
binding domaiiL 

NID2: nidogen 2 

Nidogen-2; basraient membrane protein,. 
HSU15552: addic 82 kDa pnitein mRNA 
Nonmuscle type myosin heavy chain 9 (tSYW) 

Non-muscle myosm heavy chain 9; motor protein that provides force for muscle 
contraction, cytokinesis and phagocytosis; contains an ATPase head domain and a 
rod-like tail domain. . 

NPCl: Niemann-Pick disease, type CI 

NPCl was identified as the gene that when mutated, results m Niemann-Pick C 
disease. NPCl encodes a putative integral membrane protem containing motifs 
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consistent with a role in intiaceUular transport of cholesterol to post-lysosomal 
destinations. 

Nth endwadease m-fike 1 (MBLl) 
Endonuclease; excises damaged pyrimidines. 
NUCB2: nacleobuidm 2 

Nucleobindin 2; may bind DNA and calcium; has DNA-binding and EF-hand 
domains, and a leudne-zipper. 

nuclear receptor subfamily 1, group I, member 2 (NRID) 

The gencpioduct belongs to the nuclear recq>tor siq)er&inily» members of i^ch are 
transcription factors characterized by a ligand-binding domain and a DNA-binding 
domain. The encoded protein is a transcriptional regulator of the cytochrome P450 
gene CYP3A4, binding to ttie response elonent of &e CYP3A4 promoter as a 
heterodimer wifli fbt 9-cis retinoic add receptor RXR. It is activated by a ran^ of 
conq>ounds that induce CYP3 A4, includiiig dexamediasone and ri&n^iciiL The gene 
product contains a zinc finger domaiiL Three alternatively spliced transoipts. tihat ' 
encode differrat isofonns have been described, one of vdiich encodes two products 
fluough die use of alternative translation initiation codons. Additional transoipt 
variants derived frwn alternative promoter usage, alternative splicing, and/or 
alternative polyadenylation exist, but they have not been fully described 

OGDH: oxo^tttarate (alpha-ketoglutarate) dehydrogenase (lipoamide) 

Alpha-ketoglutarate or 2-oxoglutaratc dehydrogenase; helps convert a-ketoglutarate 
to sucdnyl coenzyme A in Krebs cycle. 
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OXCT: 3-ozoacid CoA transferase 

The mitochondrial matrix enzyme 3-oxoacid CoA transferase is homodimeric. It is a 
key enzyme in ihc extrahepatic utilization of ketone bodies, catalyzing the reversible 
transfer of coenzyme A from succinyl-CoA to acetoacetate» a necessary step in 
ketolytic energy production. Deficiencies can result in intermittent ketoacidosis. 

P2RY1: purinergic receptor P2Y, G-protein coupled, 1 

Purinergic receptor P2Y1, a G protein-coupled receptor; mediates responses to ATP 
and increases inositol phosphate levels. 

PCCA: propionyl Coen^e A carboxylase, alpha polypeptide 

PCCA mcodes the alpha subunit of flie heterodimcric mitochondrial enzyme 
Propionyl-CoA carboxylase. PCCA encodes flie biotin-binding iBgion of ftis 
euzyme. Mutations in either PCCA or PCCB (encoding the beta subunit) lead to an 
enzyme deficioicy result in propionic ^^CTiia. 

PDGFB: platelet-derived growth factor beta polypeptide (simian sarcoma viral 
(v-sis) oncogene homolog) 

The protein encoded by this gene is a memba: of flie platelet-derived growfli fector 
family. The four members of this family are mitogenic factors for cells of 
mesenchymal origm and are characterized by a motif of eight cysteines. This gene 
product can exist either as a homodimer or as a heterodimer with the platelrt-derived 
growth fector al^ha polypeptide, ^erc the dimers are connected by disulfide bonds. 
Mutations in this gene are associated with meningioma. Reciprocal translocations 
betwea chromosomes 22 and 7, at sites where this gene and that for CX)L1 Al are 
located, are associated with a particular ^ of skin tumor called dermatofibio- 
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sarooma protuboans resulting 6ma uniegolated e^nession of growfli fiu:tor. Two 
' splice variants have beem identified fen- tbis gene. 

PEXIOD ORCADIAN PROTEIN 2 (iaAA0347) 

This gene is a member of the Poiod family of genes and is e:q)ressed in a ciicadian 
pattern in the siq)rachia^natic nucleus^ the primary ciicadian pacemaker in the 
mammalian brain. Genes in this family encode compoiients of the circadian Aythms 
of k)Comotor activity, metabolism, and behavior. Ciicadian expression in the 
suprachiasmatic nucleus continues in constant darkness, and a shift in the hght/dark 
cycle evokes a . proportional shift of gene expression in the suprachiasmatic nucleus. 
The specific function of this gene is not yet known. 

Perdxisome proliferative activated receptor, delta (PPARD) 

Peroxisome proliferator-activated iecq>tor delta is a member of flie steroid hoimone 
recqytor super&mily. 

PGM5: phosphoglocomutase 5 

Phoq)hoglucomutase-related (adculin) putative structural protdn; interacts with the 
cytoskeletal proteins dystrophin and utiophin. 

PLA2G3: phospholipase A2,.group m 

Qroup in secreted phospholipase A2; calcium-dq)endent, displays a pieferace for 
phosphatidylglycerol over phosphatidylcholine. 
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PLA2G4C: phosphofipase A2, group IVC (cytosolic, calcram-independent) . 

Group IVC calcium-independent phosphol^>ase a2; hydrolyzes tbe phosphoKpid sn-2 
ester bond; member of tbe phospholipase fionily . 

PLA2G6: phospholipase A2, group VI (qrtosolic, caldum-independent) 

Cytosolic caldiim-independent phospholipase_a2; hydrolyzes the phospholipid sn-2 
ester bond; member of the phospholipase &mily. 

PMVK: phosphomevalonate kinase 

Phosphomevalonate kinase; converts mevalonate-S-phosphate to mevalonate-S- 
diphosphate. 

PNMT: phenylethanolamine N-methyltransferase 

Phenyledianolamine N-metfayltransferase; converts norepinephiine to epinq)hiine. 
PONl: paraoxonase 1 
POPQ: paraozonase 2 

Paraoxonase/arylesterase 2; possibly functions in protecting low density lipoprotein 
against oxidative modification; member of a family that hydrolyzes toxic organo- 
phosphates. 

PPARA: peroxisome proliferative activated receptor, alpha 



Peroxisome proliferators are a diverse group of chemicals which include 
hypolq>idemic drugs, hed)icides» leukotriene antagonists, and plastidzers, and are so 
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called because tfaey induce an increase, in fhe size and number of peroxisomes. 
Peroxisomes are subcellular organelles found in plants and animals, and contain 
enzymes for respiration, cholesterol and lipid metabolism. Ihfact, the fibrate class of 
hypolipidemic drugs is used to reduce triglycerides and cholesterol in j>atients with 
hyperlipidemia, a major risk factor for coronary heart disease. The action of 
peroxisiome proliferators is thought to be mediated via specific receptors belonging to 
the steroid hormone receptor superfiunily, called PPARs. Thus tar, four closely 
related subtypes, a^ha, beta, gamma and delta, have been identified. The subtype 
PPAR-alpha, encoded by PPARA, is a nuclear transcription factor. Vpon activation 
by peroxisome proliferators, it modulates the expression of target genes involved in 
lipid metabolism, suggesting a role for PPAR-alpha in lipid homeostasis.. 

PPARG: peroxisome proliferative activated receptor, gamma 

The protein encoded by this gene is a member of the peroxisome prolifentor- 
activated recq)tor .(PPAR) subfimiily of nuclear recq)tors. PPARs form heterodimers 
witii retinoid X receptors (RXRs) and these heterodimers regulate transcrq>tion of 
various genes. Three subtypes of PPARs are known: PPAR-alpha, PPAR-delta, and 
PPAR-gamma. The protein encoded by this gene is PPAR*gamma and is a regulator 
of adipocyte differentiation. Additionally, PPAR-gamma has been implicated in the 
pathology of numerous diseases including obesity, diabetes, atherosclerosis, and 
cancer. Multiple transcript variants that use alternate promoters and splicing have 
been identified for this gene. At least three of these variants encode the same 
isoform. 

PPMIA: protein phosphatase lA (formerly 2C), magnesium-dependent, alpha 
isoform 

Magnesium- or manganese-dependent alpha protein phosphatase lA; regulates cell 
stress responses. 
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PROBABLE G PROTEIN-COUPLED RECEPTOR APJ, 
PTPRA: protein tyrosine phospiiati^e, receptor lype, A 

The protein e^coded by this gene is a member of tihe protein tyrosine phosphatase 
(PTP) family. PTPs are known to be signaling molecules that regulate a variety of 
cellular processes including cell growth, diflfermtiation, mitotic cycle, and oncogenic 
transfonnation. This PTP contains an extraceUular domain, a single transmembrane 
segment and two tandem intracytoplasmic catalytic domains, and thus represents a 
receptor-type PTP. This PTP has been shown to dephosphorylate and activate Src 
family tyrosine kinases, and is implicated in the regulation of integrin signaling, cell 
adhesion and proliferation. Three alternatively spliced variants of this gene, which 
encode two distinct isoformsl have been rq>orted. 

PYGM: phosphoiylase, glycogen; muscle (McArdle syndrtime, g){ycogen storage 
disease lypeV) 

Muscle glycogen phosphorylase. 

RTNl: reticnlon 1 

RXRA: retinoid X receptor, alpha 

Retinoid X receptors (RXRs) and retinoic acid receptors (RARs), arc nuclear 
receptors that mediate the biological effects of retinoids by their involvement in 
retinoic acid-mediated gene activation. These receptors exert their action by binding, 
as homodimers or heterodimers, to specific sequences in the promoters of target 
genes and regulating their transcription. The protein encoded by fliis gene is a 
member f the steroid and Ayroid hormone receptor sq^er&mily of transoiptional 
regulators. 
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RXRB: retinoid Xrecept.r, beta . 

Retinoid X recq)tor beta; binds to and serves as transcriptional coactivator for 
. retinoic acid 

SCAl: spinocerebellar ataxia 1 (olivopontocerebellar ataxia 1» antosomal 
dominant, ataxin 1) 

The autosomal dominant cerebellar ataxias (ADC A) are a heterogeneous gjcoxxp of 
neurodegenerative disorders characterized by progressive degeneration of the 
cerebellum, brain stem and spinal cord. Clinically, ADCA has been divided into 
three groups: ADCA types WH. ADCAI is genetically heterogeneous, with five 
genetic loci, designated spinocerebellar ataxia (SCA) 1, 2, 3, 4 and 6, being assigned 
to five different chromosomes. ADCAII, which always presents with retinal 
degeneration (SCAT), and ADCAm often referred to as the >ure* cerd)ellar 
syndrome (SCAS), are most likely homogeneous disordos. Several SCA genes have 
been cloned and shown to contain CAG repeats in flieir coding regions. ADCA is 
. caused by the expmAon of the CAG rq>eats, producii^ an elongated pofyglutamine 
tract in fte corresponding protein. The expanded repeats are variable in size and 
unstable, usually increasing in size when transmitted to succesdve generations. The 
fimction of the ataxins is not known. The SCAl locus has been ms^ed to 
chromosome 6, and it has been detennined that the diseased aUele contams 41-81 
CAG repeats, compared to 6-39 in the normal allele. Several transcript variants of 
SCAl in the 5' UTR have been described; however, their fiiU-length nature is not 
known.. 

SDFl: stromal cell-derived factor 1 

Stromal cell-derived factor 1; lymphocyte chemoattractant itiBt signals flnough the 
recq>torCXCR4. 
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SERPnVAS: serine (or cysteine) proteinase inhibitor, dade A (aIpha-1 
antiproteinase, antitiypsin), member 5 

Protein C inhibitor. (plasminogen activator inhibitor IH); may be a serine protease 
inhibitor member of the serpin fionily of serine protease inhibitors. 

SERPINHl: serine (or cysteine) proteinase inhibitor, dade H (heat shock 
protein 47), member 1, (collagen binding protein 1) 

Colligin; collagen-binding protein; Similar to HSPs and to serpin family serine 
protease inhibitors. 

SLC21A6: solute carrier family 21 (organic anion transporter), member 6 

Organic anion transporter. 
. SLC27A1: solute carrier family 27 (fatly add transporter), member 1 

SULTl A2: sulfotransferase family, cytosolic, 1 A, phenol-preferring, member 2 

Phenol-metabolizing sulfotransfoase 2; sulfonates simple planar phenols. 
. THBS3:Thrombospondui3 

Tbrombospondin 3 binds bq>arin and caldum; similar^ murine Thbs3 

TBP: TATA box bindmg protdn 

TATA box binding protdn, caiq)onent of the TFIID complex; functions in Hit 
initiation ofmlU^A syndesis and bdlsaltzanscripti a 



WO2004/0187( 



PCT/EP2003/009126 



•63- 

TBXA2R: thromboxane A2 receptor 

Thromboxane A2 receptor Qirostaglandin H2 recq)tor); G protein-anq)led receptor, 
activates Ca2+-activated chloride channelsj stimulates platelet aggregation and 
smooth muscle constrictibrL 

TC3«*2: transcription factor 2, hepatic; LF-B3; variant hepatic nndear factor 

TCF2 cmcodes transcription fiEwtor 2, a liver-specific factor of the homeobox- 
containing basic helix-tum-helix family. The TCF2 protein is believed to form 
heterodimers with another liver-specific member of this transcription factor family, 
TCFl; dqjending on the TCF2 isoform, the result may be to activate or inhibit 
transcription of target genes. Mutation of TCF2 that disn^ts normal fimction has 
been identified as the cause of MODYS (Maturity-Onset of Diabetes, Type S). A 
third human transcript variant is believed to exist based on such a variant in the rat: 
however, to date such an mRNA species has not been isolated. 

TETRAN: tetracycline transporter-like protein 

Similar to E. coli tetracycline resistance efflux proteirL 

TGFBl: transforming growA factor, beta 1 (Camurati-Engelmann disease) 

Transforming growth factor-beta 1; regulates cell proliferation, differentiation, and 
apoptosis. 

TGFB2: transforming growth factor, beta 2 

Transforming growth factor-beta 2 (glioblastoma-derived T cell suppressor factor); 
siq>presses IL2 - dq)endent growth bf T cells; member of a family of cytokines fiiat 
trammits signals through transmembrane serine/Oueonine kinases. 
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TGFB3: transforming growth factor, beta 3 

Transforming growth &ctor-beta 3; transmits signals through transmembrane 
serine/tfireonine kinases, may be required for normal development of the lung and 
palate; member of family of cytokines, very strongly similar to murine Tgfb3. 

THPO: thrombopoietin (myeloproliferative leukemia virus oncogene Ugand, 
megakaryocyte growth and development factor) 

Thrombopoietin; binds to c-Mpl recq>tor and regulates megakaryocyte development 

TNFAIP2: tumor necrosis factor, alpha-induced protein 2 

Secreted by vascular endothelium, e3q)ression is induced by tumor necrosis factor 
alpha, interleukin-1 beta, and lipopolysacdiaride. 

TRAPl; heat shock protein 75 

Heat shock protein 75; binds and rrfolds denatured RBI during M phase and afier 
heat shock; member of the HSP90 fiunily of molecular chsq)m>nes. 

TRIPIO: thyroid hormone receptor interactor 10 

Similar to the non-kinase domains of FER and Fes/Fps tyrosine kinases; binds to 
activated Cdc42 and may regulate actin cytoskeleton; contains an SH3 domauL 

TXN: thioredoxin 



Thiorcdoxin; has dithiol-disulfide osddoreductase activity. 
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USP6: ttbiquitin specific protease 6 (Tre-2 oncogene) 

Ubiquitin specific protease 6 (Tre-2 oncogene); cleaves ubiquitin fcom proteins, has 
predicted nucleic add-binding properties. 

UTRN: utrophin (homologous to dystrophin) 

This gene shares both structural and functional similarities with the dystrophin gene. 
It contains an actin-binding N-termiiius, a triple coiled-coil repeat central region, and 
a C-terminus that consists of protein-protein interaction motiis which interact with 
dystroglycan protein components. The protem encoded by this gene is located at the 
neuromuscular synapse and myotendinous junctions, where it particq)ates in post- 
synaptic membrane maintenance and acetylcholine recqitor clustering. Mouse 
studies suggest that this gene may serve as a functional substitute for the dystrophin 
gene and therefore, may serve as a potential tfier^eutic altmiative to muscular 
dystrophy which caused by mutations in the dystrophin gene. Alternative splicing of 
the utrophin gene has hem described; however, flie fuU-lengdi naturo of these 
variants has not yet been determined* 

VEGF: vascular endofhelial growth factor 

Vascular endodielial growth &ctor; induces radothelial cell proUferation and 
vascular permeability. 

VEGFB: vascular endothelial growth factor B 

Vascular endothelial growth fiu^tor B; mvolved in angiogenesis and endothelial cell 
growth. 
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WISPl : WNTl inducible signaling pathway protein 1 

This gene encodes a member of the WNTl indudble signaling pathway (WISP) 
protein subfamily, which belongs to the connective tissue growth £3|ctor (CTGF) 
5 fimily. WNTl is a member of a family of c^steine-rich, glycosylated signaling 
proteins fbst mediate diverse developmental processes. The CTGF £miily members 
are characterized by four conserved cysteine-rich domains: insulin-like growth 
fictor-binding domain, von Willebrand factor type C module, tiirombospondin 
domain and C-terminal cystine knot-like domain. This gene may be downstream in 

10 the WNTl signaling pathway that is relevant to malignant transformatioiL It is 
expressed at a high level in fibroblast cells, and overe3q)ressed in colon tumors. The 
encoded protein binds to decorin and biglycan, two members of a family of small 
leucine-rich proteoglycans present in the extracellular matrix of connective tissue, 
and possibly prevents ibe inhibitory activity of decorin and biglycan in tumor cell 

IS proliferation. It also attoiuates pS3-mediated apoptosis in response to DNA damage 
through activation of tihe Akt kinase. It is 83% identical to the mouse protein at the 
amino add level. AHeinative splicing of this gene generates 2 transcript variants.. 

XDH: zanAene dehydrogenase 

20 

Xanthine dehydrogenase belongs to tihe groiq) of 'molybdenum-containmg 
hydroxylases involved m the oxidative metabolism of purines. The enzyme is a 
homodimer. Xanthine dehydrogenase can be converted to xanthine oxidase by 
reversible sulfhydiyl oxidation or by irreversible proteolytic modification. Defects in 
25 xanthine dehydrogenase cause xanthinuria, may contribute to adult respiratory stress 
syndrome, and may potentiate influenza infection through an oxygen metabolite- 
dependent mechanism. . 
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YAPl: Yes-ass -ciated protein 1, 65 kD 

Yes-associated proteii^ binds to the pioto-oncoprotein Yes; has a WW domain. 
PROCR: protein C receptor, endothelial (EPCR) 

Endothelial Protein C receptor, binds protein C in a calcium-dependent manner, 
member of the CDl/m^or histocompatibility complex sqper&mily. 

STXl A: syntaxin lA (brain) 

Syntaxin 1 A (brain); mvolved in intracellular transport and neurotransmitt^ release. 

As SNPs are linked to other SNPs in neighboring genes on a chromosome (Linkage 
Disequilibrium) those SNPs could also be used as maiker SNPs. In a recent 
publication it was shown that SNPs are linked over 1 00 Id) in some cases more than 
150 kb (Reich D.E. et aL Nature 41 1, 199-204, 2001). Hence SNPs lying in i^ons 
ndghbouring PA SNPs could be linked to the latter ^and by this bdng a diagnostic 
maiker. Iliese associations could be pecfonned as described for fht gene 
po^onorphism in methods. 

Definitions 

For convenience, the meaning of certain terms and phrases employed in the 
specification, examples, and appended claims are provided below. Moreover, the 
definitions by itself are intended to explain a fiirther background of the invention. 

The term "allele", which is used interchangeably herein with "allelic variant" refers 
to alternative forms f a gene or portions thereof Alleles occq>y the same locus or 
position on homologous chromosomes. When a subject has two identical alleles of a 
gene, the subject is said to be homozygous for the gene or allele. When a subject has 
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two different alleles of a gene, the subject is said to be heterozygous for the gene. 
Alleles of a specific gene can differ finom each oAer in a single nucleotide, or sevml 
nucleotides, and can include substitutions, deletions, and insertions of nucleotides. 
An allele of a gene can also be a fonn of a gene containing a mutation. 

The term ''allelic variant of a polymoiphic region of a gene" refm to a region of a 
gene having one of several nucleotide sequences found in that r^on of the gene in 
oflier individuals. 

THomology" or "identity" or "similarity^* refers to sequence similarity between two 
pq)tides or between two nucleic acid molecules. Homology can be determined by 
comparing a position in each sequence which may be aligned for purposes of 
comparison. When a position in the compared sequence is occiq)ied by the same base ' 
or amino add, then the molecules are homologous at that position. A degree of 
homology between sequences is a function of the number of matriiing 
homologous positions shared by the sequences. An "unrelated" or "non-homologous" 
sequence shares less flian 40% idoitity, though prefmbly less than 25% identity, 
wift one of the sequences of the present inventioit 

The temi "a homologue of a nucleic acid"'x€fen5 to a nucleic add having a nucleotide 
sequence having a certain degree of homology with the nucleotide sequence of the 
nucldc add or conq>lement Hiereot A homologue of a double stranded nucldc add 
having SEQ ID NO. X is intended to include nucleic acids having a nucleotide 
sequence which has a certain degree of homology with SEQ ID NO. X or with the 
complement thereof Preferred homologous of nucleic adds are capable of 
hybridizing to the nucldc add or CQnq)lanent thereof. 

The term "interact" as used herein is meant to include detectable mleractions between 
molecules, such as can be detected using, for example, a hybridization assay. 
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The tenn interact is also meant to include ''binding'* interactions between molecules. 
Intoactions may be, for example, piotdn-protein, protein-nucleic acid, protein-small 
iholecule or small molecule-nucleic acid in nature. 

The tmn "intronic sequence" or "intronic nucleotide sequence" refers to tiie 
nucleotide sequence of an intron or portion thereof. 

The term "isolated" as used herein wifli reject to nucleic acids, such as DNA or 
RNA, refers to molecules separated fiom other DNAs or RNAs, respectively, that are 
present in the natural source of the macromolecule. The term isolated as used herein 
also refers to a nucleic acid or peptide that is substantially free of ceUular material, 
viral material, or culture medium when produced by recombinant DNA techniques, 
or chemical precxuisors or other chemicals when chemically synthesized. 

Moreover, an "isolated nucleic add" is meant to include nucleic add fragments 
which are not naturally occurring as fragments and would not be found in die natural 
state. The tmn "isolated" is also used herein to refer to polypq>tides whidi are 
isolated from other cellular proteins and is meant to enconqiass bofli purified and 
recombinant polypq>tides. 

The term "lipid" shaU refer to a &t or .&t-like substimce that is insoluble in polar 
solvents sudi as water. The term "lipid" is intended to include true &ts (e.g. esters of 
firtty adds and glycerol); lipids (phospholipids, cerebrosides, waxes); sterols 
(cholesterol, ergosterol) and lipoproteins (e.g. HDL, LDL and VLDL). 

The term "locus" refers to a specific position in a chromosome. For example, a locus 
of a gene refers to the chromosomal position of the gene. 

The term "modulation" as used herein refers to both iip-regulation, ^e., activation or 
stimulation), for exBsaple by agonizing, and down-^rpgulation (i.e. inhibition or 



PCT/EP20a3/009126 

-70- 

s»q)prMsion), fi>r exaxaple by antagomzing of a bioactivity (eg. expresaon of a 
gene). 

The tenn "molecular structure" of a gene or a portion tboeof refers to the stmcture as 
defined by die nucleotide content (including deletions, substitutions, additions of one 
or more nucleotides), the nucleotide sequence, the state of methylation, and/or any 
otfao* nu>dification of the goie or portion tfaaeof 

The teim "mutated gme" refers to an allelic fonn of a gene, which is enable of 
altering the phenotype of a subject having the mutated gene relative to a subject 
whidi does not have the mutated gene. If a subject must be homozygous for this 
mutation to have an altered pheno^, the mutation is said to be recessive. If one 
copy of the mutated gene is sufficient to aha the genotype of the subject die 
mutation is said to be dominant If a subject has one copy of die mutated gene and 
has a phenotype that is intomediate between diat of a homozygous and Oat of a 
heteio^gous (fin- tiiat gene) subject, die mutation is said to be cordominant 

As osed herein, die torn "nucleic acid" refers to polynucleotides such as deoxy- 
nbooucleic acid (DNA), and, where qipiopiiate, ribonucleic acid (RNA). The term 
should also be understood to indude, as equivalents, derivatives, variants and 
analegs of d&er RNA or DNA made fiom nucleotide analogs, including pq)tide 
nucleic acids CPNA), morpholino ofigonucleotides (J. Summerton and D. Wella, 
Antisense and Nucleic Add Drug Development 7;187 (1997)) and, as i^Hcable to 
the embodiment being described, single (sense or antisense) and double-stranded 
polynucleotides. Deoxyribonucleotides include deoxyadenosine, deoxycytidine, 
deoxyguanosine, and deoxythymidine. For purposes of clarity, when referring herein 
to a nucleotide of a nucldc acid, which can be DNA or an RNA. flie tenn 
"adenosine", "cytidinc", "guanosine", and "thymidine" are used. It is understood that 
if the nucldc add is RNA, a nucleotide having a uracil base is uridine. 
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The tenn "nucleotide sequence complementaiy to the nucleotide sequence set forfli in 
SEQ ID NO. x** refers to the nucleotide sequence of the conq>lementaiy strand of a 
nucleic add strand having SEQ ID NO, x. The term "conq)lementaiy strand" is used 
herein interchangeably with the term "complement". The complement of a nucleic 
acid strand can be the comploneht of a coding strand or the coniplement of a non- 
coding strand. When referring to double stranded nucleic acids, the con9)lement of a 
. nucleic acid having SEQ ID NO. x refers to the complementary strand of the strand 
having SEQ ID NO. x or to any nucleic acid having the nucleotide sequence of flie 
complementary strand of SEQ ID NO. x. When referring to a single stimded nucleic 
acid having the nucleotide sequence SEQ ID NO. x, the complement of this nucleic 
acid is a nucleic acid having a nucleotide sequence which is complementary to that of 
SEQ. ID NO. X. The nucleotide sequences and complementary sequences hereof are 
always given in the 5' to 3' direction. The tenn "complement" and "reverse 
con^lemenf are used interchangeably hereiiL 

The term "pperably linked" is intended to mean diat the promoter is associated with 
die nucleic add in such a manner as to facilitate transci^tion of &e nucldc add. 

The term "polymorphism" refers to Ae coexistence of more than one form of a gene 
or portion thereof. Aportion of a gene of which fliere are at least two diflFcrent forms, 
i.e., two different nucleotide sequences, is referred to as a "polymorphic region, of a 
gene". A polymorphic region can be a single nucleotide, the identity of which differs 
in different alleles. A polymorphic region can also be several nucleotides long. 

A "polymorphic gene" refers to a gene having at least one polymorphic region. 

To describe a '^polymorphic site** in a nucleotide sequence often diere is used an 
"ambiguity code'* that stands for the posdble variations of nucleotides in one site. 
The list of ambiguity codes is summarized in the following table: 
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Ambiguity Codes 
(lUPAC Nomenclature) 


B 


c/g/t 


D 


a/g/t 


H 


a/dt 


K 


g/t 


M 


a/c 


N 


a/c/g/t 


R 


a/g 


S 


c/g 


V 


a/c/g 


W 


a/t 


Y 


c/t 



So, for example, a •'R*' in a nucleotide sequence means fbzt either an "a" or a **g'* 
could be at that position. 

The terms -protein", "polypeptide" and "peptide" are used interchangeably herein 
when referring to a gene product 

A "regulatoiy element", also termed herein "regulatory sequence is intended to 
include elements which are cq)able of modulating transcription fixmi a basic 
promoter and include elem^ such as enhancers and silencers. The term "enhance", 
also referred to herein as "enhancer element", is intended to inchide regulatory 
elements enable of increasing, stimulating, or enhancing transcription from a basic 
promoter. The term , "silencer", also refisrred to haein as "silencer element" is 
intended to inchide regulatory elements capable of decreasmg, inhibitixig, or 
repressing transcription from a basic promoter. Regulatory elements are typically 
present in 5' flanking regions of genes. However, regulatory elements have also been 
shown to be present in other regions of a gene, in particular in introns. Thus, it is 
possible that jgencs have regulatory elements located in introns, exons, codii^ 
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regions, and 3* flanking sequences. Sudi regulatory elements are . also intended to be 
encompassed by tbe present invention and can be identified by any of flie assays that 
can be used to identify regulatory elements in 5' flanlring regions of genes. 

The term "regulatory element" finflier encompasses "tissue specific" regulatory 
elements, i.e., regulatoiy elements which effect e7q)ression .of the selected DNA 
sequence preferentially in specific cells (e.g., cells of a specific tissue), gene 
expression occurs preferentially in a specific cell if expression in this cell type is 
significantly higher than expression in other cell types* The term "regulatory 
elemrat" also encompasses non-tissue specific regulatory elements, i.e,, regulatory 
elements which are active in most cell types. Furthermore, a regulatory elemait can 
be a constitutive regulatory element, i.e., a regulatory elanent which constitutively 
regulates transcription, as opposed to a regulatory element which is inducible, i.e., a 
regulatory element which is active primarily in response to a stimulus. A stimulus 
can be, e.g., a molecule, such as a hormone, cytoldne, heavy metal, phorbol ester, 
cyclic AMP (cAMP), or retinoic acid. 

Regulatory elements are typically bound by proteins, c.g., transcr^tion fectors. The 
tenn "transoiption fictor" is intended to include proteins modified forms thereof 
vfidch interact preferentially with specific nucleic acid sequences, i.e., regulatory 
elements, and which in appropriate conditions stimulate or repress transcription- 
Some transcription factors are active when they arc in tiie form of a monomer. 
Alternatively, other transcription factors are active in tiie form of a dimer consisting 
of two identical proteins or different proteins (heterodimer). Modified fom^ of 
transcription factors are intended to refer to transcription factors having a post- 
translational modification, such as the attachment of a phosphate group. The activity 
of a transcription factor is frequently modulated by a post-translational modificatioa 
For example, certain transcription factors are active only if fliey are phoq>hor^ated 
on q>ecific residues. Alternatively, transcription fiictors can be active in Oxt absence 
of phosphorylated residues and become inactivated by phosphorylation. A list of 
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known transcription fectors and their DNA binding site can be found, e.g., in public 
databases, e.g., TFMATRIX Transcription Factor Binding Site Proffle database. 

As used herein, the term "specifically hybridizes" or "q>ecifically detects" refers to 
the abiliQr of a nucleic add molecule of the invention to hybridize to at least 
approxmiately 6, 12, 20, 30, 40, 50, 60. 70, 80, 90, 100, 110, 120, 130 or 140 
consecutive nucleotides of either strand of a gene. 

The term "wild-^rpe allele" refers to an allele of a gene which, when present in two 
copies in a subject results in a wild-type phenotype. There can be several different 
wild-type alleles of a specific gene, since certain nucleotide changes in a gene may 
not affect the phenotype of a subject having two copies of the gene with the 
nucleotide changes. 

"Adverse drug reaction" (ADR) as used herein refers to an i^reciably harmfiil or 
ux^leasant reaction, resulting from an intormtion related to the use of a medicinal 
product, ^ch predicts hazard 6om fiiture administration and wanants prevention or 
specific treatment, or alteration of the dosage regimen, or witfidrawal of the product 
In if 8 most severe fonn an ADR mig^t lead to the death of an individual. 

The teem "Drug Respond" is intended to mean any response that a patient exhibits 
i^on drug administration. Specifically drag response includes beneficial, i.e. desired 
drug effects, ADR or no detectable reaction at all. More specifically the temi drug 
response could also have a qualitative meaning, i.e. it embraces low or high 
beneficial effects, respectively and mild or severe ADR, respectively. The terai 
"Statin Reqjonsc" as used herein refers to drug response after statin administration. 
An individual drug response includes also a good or bad metabolizing of the drug, 
meaning that 1)ad mctabolizers** accumulate the drug in the body and by this could 
show side effects of the drug due to accumulative overdoses. 
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"Candidate gene" as used hereiii includes genes that can be assigned to either normal 
cardiovascular fimction or to. metabolic pathways that are related to onset and/or 
progression of cardiovascular diseases. 

S With regard to drug response the tenn "candidate gene" includes genes diat can be 
assigned to distinct pheno^es regarding the patient's response to drug 
* administration. Those phenotypes may include patients who benefit 6om relatively 
small amounts of a given drug (hig}i respondos) or patients who need relatively high 
doses in ord^ to obtain the same benefit (low responders). In addition those 
10 * phenotypes may include patients who can tolerate high doses of a medicament 
without exhibiting ADR, or patients who suflfer fix)m ADR even after receiving only 
low doses of a medicament 

As nei&er the development of cardiovascular diseases nor the patient's response to 
IS drug administration is completely understood, ihe term "candidate gene" may also 
comprise genes with presently unknown fimcticm. 

"PA SNP" 0>henotype associated SNP) refers to a polymorphic site which $hows a 
significant assodation with a patients phenotype (healthy, diseased, low or high 
20 responder, drug tolerant, ADR prone, etc.) 

"PA gene" (phenotype associated gene) refers to a genomic locus.harbounng a PA 
SNP, irrespective of the actual fimction of this gene locus. 

25 PA gene polypeptide refers to a polypeptide encoded at least in part by a PA gene. 

The term "Haplotype** as used herdn refers to a group of two or more SNPs that are 
fimctionally and/or ^atially linked. Le. haplotype$ define groq>5 of SNPs ibat lie 
inside genes belcmging to identical (or related metabolic) pathways and/or lie on tfie 
30 same chromosome. Hq>lotypes are expected to give better predictive/diagnostic 
information than a single SNP 
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The tenn **statm** is intoided to embrace all inhibitors of the enayme 3-hydn)xy-3- 
methylglutaiyl coenzyme A (HMG-CoA) reductase. Statins specifically inhibit the 
enzyme HMG-CoA reductase which catalyzes the rate limiting step in cholesterol 
biosynthesis. Known statins are Atorvastatin, Cerivastatin, Fluvastatm, Lovastatin, 
Pravastatin and SimvastatirL 

Methods for Assessing Cardiovascular Status 

The present invmtion provides diagnostic methods for assessing cardiovascular 
status in a human individual. Cardiovascular status as used herein refers to the 
physiological status of an individual's cardiovascular system as reflected in one or 
more markers or indicators. Status markers include without limitation clinical 
measurements such as, e.g., blood pressure, electrocardiographic profile, and 
differentiated blood flow analysis as well as measuremrats of LDL- and HDL- 
Cholesterol levels, other lipids and other well established clinical parameters that are 
standard in the art Status markers according to the invention include diagnoses of 
one or more cardiovascular syndromes, sudi as, e.g., hypert^on, acute myocardial 
infirction, silent myocardial in&rotion, stroke, and atfaerosclm>sis. It will be 
understood that a diagnosis of a cardiovascular syndrome made by a medical 
practitioner encompasses cliiucal measuronents and medical judgement Status 
markers according to the invention are assessed using conventional m^ods weU 
known in the art Also included in the evaluation of cardiovascular status are 
quantitative or qualitative changes in status markers with time, such as would be 
used, e.g., in the determination of an individual's req>0D5e to a particular therapeutic 
regimen. 
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The methods are canied out by die steps of: 

(i) determining the sequence of one or more polymorphic positions within one, 
several or all of the genes listed in Examples or oflier genes mentioned in ttus 
file in the individual to establish a polymorphic pattern for tfie individual; and 

(ii) comparing the .polymoiphic pattern established in (i) with the polymorphic 
patterns of humans exhibiting different markers of cardiovascular status. The 
polymorphic pattern of the individual is, preferably, highly similar and, most 
preferably, identical to the polymorphic pattern of individuals who exhibit 
particular status mariccrs, cardiovascular syndromes, and/or particular pattems 
of response to thospeutic interventions. Polymorphic pattems may also 
include polymorphic positions in other genes which are shown, in 
combination with one or more polymorphic positions in the genes listed in the 
Exarriples, to correlate with the presence of particular status markers. In one 
embodiment, the mettiod involves comparing an individual's polymorphic 
pattern widi polymorphic patterns of individuals who have been shown to 
respond positively or negatively to a particular thmq^eutic regimen. 
Therapeutic regimea as used herein refers to treatments aimed at the 
elimination or amelioration of symptoms and events associated cardiovascular 
disease. Such treatments include without limitation one or more of alteration 
in diet, lifestyle, and exercise regimen; invasive and noninvasive surgical 
techniques such as atiierectomy, angioplasty, and coronary bypass surgery; 
and pharmaceutical interventions, such as administration of ACE inhibitors, 
angiotoisin n receptor antagonists, diuretics, alpha-adrenorecq>tor 
antagonists, cardiac glycosides, phosphodiesterase inhibitors, beta- 
adrenoreccptor antagcmists, calcium channel blockers, HMG-CoA reductase 
inhibitors, imida7>oliDe receptor blodcm, . endotiielin recqptor bloctos» 
organic nitrites, and modulators of protein function of genes listed in tiie 
Exan^les. Interventions with pharmaceutical agents not yet known whose 
activity correlates with particular polymorphic pattems associated with 
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cardiovascular disease are also eiicoiiq)assed. It is conten:q>]ated, for example, 
that patieats who are candidates for a particular therapeutic regunen will be 
screened for polymorphic pattems thdX correlate with responsivity to fliat 
particular regimen. 

In a preferred embodiment, the method involves comparing an individual's 
polymorphic pattern with polymorphic patterns of individuals who exhibit or have 
exhibited one or more markers of cardiovascular disease, such as, e.g., elevated IDL- 
Cholesterol levels, high blood pressure, abnormal electrocardiogr^hic profile, 
myocardial infarction, stroke, or atherosclerosis. 

In ano&er embodiement, the method involves comparing an individual's 
polymorphic pattern with polymorphic pattems of individuals who exhibit or have 
exhibited one or more drug related phenotypes, such as, e.g., low or higji drug 
response, or adverse drug reactions. 

In practicing the mediods of the invention, an individual's pofymoiphic pattern can 
be established by obtaining DNA fiom the individual and determming the sequence 
at predetermined polymorphic positicms in the genes such as those described in this 
file. 

The DNA may be obtained fiom any cell source. Non-limiting examples of cell 
sources available in clinical practice include blood cells, buccal cells, cervicovaginal 
cells, q)ithclial cells from urine, fetal cells, or any cells present in tissue obtained by 
biopsy. Cells may also be obtained from body fluids, including wiifliout limitation 
blood, saliva, sweat, urine, cerebrospinal fluid, feces, and tissue exudates at the site 
of infection or inflammation. DNA is extracted fiom the cell source or body fluid 
using any of the numerous methods that are standard in tiie art It will be understood 
fliat the particular method used to extract DNA will depend on ttie nature of the 
source. 
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Diagnostic and Prognostic Assays 

The present invoitioii provides metfiods for determining the molecular, structure of at 
least one polymorphic region of a gene, specific allelic variants of said polymorphic 
region being associated wifli cardiovascular disease. In one embodiment, detennining 
the molecular structure of a polymorphic region of a gene comprises detemiining the 
identity of flie allelic variant A polymorphic region of a gene, of which specific 
alleles are associated with cardiovascular disease can be located in an exon, an 
intron, at an intron/exon border, or in the promoter of the gene. 

The invention provides methods for determining whethCT a subject has, or is at risk, 
of developing a cardiovascular disease. Such disorders can be associated wifli an 
aberrant gene activity, e.g., abnormal binding to a form of a lipid, or an aberrant gene 
protein level An aberrant gene protein level can resuh fix»n an abenant transoiption 
or post-transcriptional r^ation. Thus, allelic diflBaences in specific regions of a 
gene can result in dififerences of gene protein due to diffmiices in regulation of 
expression. In particular, some of the identified polymorphisms in flie human gene 
may be associated with differences in flie level of transcription, RNA maturation, 
splicing, or translation of the gene^r transaction product 

In prcfenred embodiments, the me&ods of the invention can be chaiacterize4 as 
comprising detecting, in a sample of cells fiom flie subject, the presence or absence 
of a specific allelic variant of one or more polymorphic regions of a gene. The alleUc 
differences can be: (i) a difference in the identity of at least one nucleotide or (ii) a 
diffCTence in the numbo- of nucleotides, which difference can be a single nucleotide 
or several nucleotides. 

A preferred detection method is allele specific hybridization using probes over* 
hippiqg the polymorphic site and having about 5, 10, 20, 25, or 30 nucleotides around 
the polymorphic region. Examples of probes for detecting specific allelic variants of 
the polymorphic region located in intron X are probes comprising a nucleotide 
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sequence set forth in any of SEQ ID NO. X. In a preferred embodiment of the 
invention, several probes enable of hybridizing specifically to allelic variants are 
attached to a solid phase support, e.g., a "chip". Oligonucleotides can be bound to a 
solid siqjport by a variety of processes, inchxding lithogr^hy. For example a ch^ 
5 can hold up to 250,000 oligonucleotides (GeneChip, Aflfymetrix). Mutation deflection 
analysis using these chips comprising oligonucleotides, also tenned "DNA probe 
arrays- is described e-jg., in Cronin et al. (1996) Human Mutation 7:244 and in Kozal 
et al. (1996) Nature Medicine 2:753. In one embodiment, a chip comprises all the 
aUelic variants of at least one polymoiphic region of a gene. The solid phase suppprt 
is then contacted with a test nucleic add and hybridization to the specific probes is 
detected. Accordingly, the identity of numerous allelic variants of one or more genes 
can be identified in a sinq)le hybridization experiment For example, tiie identity of 
the allelic variant of the nucleotide polyinoiphism of nucleotide A or G at position 33 
of Seq m 1 (baySNP179) and that of other possible polymorphic regions can be 
determined in a single hybridization e9q>erimenL 

In other detection metiiods, it is necessary to first amplify at least a portion of a gene 
prior to identifyir^ the allelic variant Amplification can be performed, e.g., by PGR 
and/or LCR, according to methods known in tiie art b one embodunent, genomic 
DNA of a cell is exposed to two FCR primos and anq>lification for a number of 
cycles suffident to produce the required amount of azr4)lified DNA. In prefmed 
onbodunents, the primers are located between 40 and 350 base pairs apart Prefeired 
primers for an^li^ing gene fiagments of genes of this file are listed in Table 2 in 
the Examples. 

Alternative an^lification methods include: self sustained sequence rq)lication 
(Guatelli, J. C et al., 1990, Proc. Natl. Acad. Sci. U.SA. 87:1874-1878), transcrip- 
tional anq)lification system (Kwoh, D. Y, el al., 1989, Proc. Natl. Acad. ScL U.S.A. 
86:1173-1177), (}-Beta Replicase (Lizardi, P. M. et al., 1988, BioH^cchnology 
6: 1 197), or any other nucleic acid amplification method, followe4 by the detection of 
the amplified molecules using techniques well known to tiiose of skill in flie art 
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These detection schemes are especially useful for flie detection of nucleic add 
molecules if such molecules ace present in very low numbers. 

In one embodimrat, any of a variety of sequencing reactions known in the ait can be 
used to directly sequence at least a portion of a gene and detect allelic variants, e.g., 
mutations, by comparing the sequence of die san^le sequence with flie 
corresponding wild-type (control) sequence. Exemplary sequencing reactions include 
those based on techniques developed by Maxam and Gilbert (Proc. Natl Acad Sci 
USA (1977) 74:560) or Sanger (Sanger et al (1977) Ftoc. Nat Acad Sd 74:5463). It 
is also contemplated tfiat any of a variety of automated sequencing procedures may 
be utilized when poforming the subject assays (Biotechniques (1995) 19:448), 
including sequencing by mas§ spectrometry (see, for example, U.S. Pat. No. 
5,547,835 and international patent 25)plication Publication Numbw WO 94/16101, 
entitled DNA Sequencing by Mass Spectrometry by H. Koster, US, Pat No. 
5,547,835 and international patent qiplication Publication Number WO 94/21822 
entitled ''DNA Sequencing by Mass Spectrometry Via Exonuclease Degradation" by 
H. Koster), and U.S. Pat No. 5,605,798 and bteroational Patent Application No. 
PCT/US96/03651 entitled DNA Diagnostics Based on Mass Spectrometry by H. 
Koster, Cohen ct aL (1996) Adv Chromatogr 36:127-162; and GriflBn ct at (1993) 
Appl Biochem Biotedmol 38:147459). It will be evident to one sldUed in the art 
that, for certain embodiments, tte occurrence of only one, two or three of tiie nucldc 
add bases need be detemuned in the sequendng reaction. For instance, A-track or 
the like, e.g., where only one nucleotide is detected, can be carried out 

Yet other sequencing methods are disclosed, e.g., in U.S. Pat No. 5,580,732 entitled 
••Method of DNA sequencing employing a mixed DNA-polymer chain probe" and 
U.S. Pat No. 5,571,676 entitled -Method for mismatch-directed in vitro DNA 
sequencing". 

In some cases, Hxe presence of a specific allele of a gene in DNA fiom a subject can 
be shown by restriction eozyme analysis. For example, a specific nucleotide poty- 
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moiphism can result in a nucleotide sequence con^rising a restriction site which is 
absent fix>m the nucleotide sequence of anotfier allelic variant 

In other embodiments, alterations in electrc^horetic mobility is used to identify the 
type of gene allelic variant For example, single strand conionnation polymorphism 
(SSCP) may be used to detect differences in electrophoretic mobility between mutant 
and wild type nucleic acids (Orita et al. (1989) Proc Natl. Acad, Sci USA 86:2766; 
see also Cotton (1993) Mutat Res 285:125-144; and Hayashi (1992) Genet Anal Tech 
^1 9:73-79). Single-stranded DNA fragments of sample and control nucleic acids 
are denatucd and allowed to renature. The secondary stracture of single-sliarided 
nucleic acids varies according to sequence, the resulting alteration in electrophoretic 
mobility enables the detection of even a single base change. The DNA fragments 
may be labeled or detected with labeled probes. The sensitivity of tilie assay may be 
enhanced by using RNA (rather to DNA), in which the secondary structure is more 
saisitive to a change in sequence. In anofter inrefemed embodiment, the subject 
method utilizes heterodiq)lex analysis to separate double stranded heteroduplex 
molecules on flie basis of dtanges in dectn^horetic mobilify (Keen et aL (1991) 
'n«ndsGaiet7:5). 

In yet another embodiment, the identity of an aUelic variant of a polymorphic region 
is obtained by analyzing the movement of a nucleic add comprising the polymorphic 
region in polyaoylamide gels containing a gradient of denatorant is assayed using 
denaturing gradient gel electrophoresis (DGGE) (Myers et al (1985) Nature 
313:495). When DQGE is used as the method of analysis, DNA wiU be modified to 
insure that it does not completely denature, for example by adding a GC clamp of 
approximately 40 bp of high-melting GC-rich DNA by PGR. In a fimher 
embodimoit, a tenq)erature gradient is used in place of a denaturing agent gradient to 
identify differences in fhe mobility of control and. sanq>le DNA ^senbaum and 
Reissncr (1987) Biophys Chein 265:1275). 
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Exaiiq>Ies of techniques for detecting differences of at least one nucleotide between 2 
nucldc adds include, but are not limited to, selective oligonucleotide hybridization, 
selective amplification, or selectiye primer extension. For eswnple, oligonucleotide 
probes may be prepared in which die known polymorphic nucleotide is placed 
centrally (aHele-^pecific pn>bes) and flien hybridized to target DNA 
which permit hybridization only if a perfect match is found (Saila et bL (1986) 
Nature 324:163); Saiki et al (1989) Proc. Natl Acad. Sd USA 86:6230; and Wallace 
et al. (1979) NucL Adds Res. 6:3543). Such allele specific oligonucleotide hybridi- 
zation techniques may be used for the simultaneous detection of several nucleotide 
chan^ in different polymorphic- regions of gene. For exan^le, oligonucleotides 
having nucleotide scquOTces of specific allelic variants are attached to a hybridizing 
membrane and this membrane is thai hybridized with labeled saiiq)le nucldc add. 
Analysis of the hybridization signal will tfa» reveal die identity of the nucleotides of 
the sainple nucldc acid 

Alternatively, allele spedfic aiiq>lificatioii tedmology depends <m selective 
PGR aiiq>lificatioii may be used. Oligonucleotides used as i»imen for spod&c 
anplificatioD may cany die allelic variant of interest in die center of die molecule (so 
diat amplificadon itptsads on diiSferential hybridization) (Gibbs et al (198^) Nucleic 
Acids Res. 17:2437-2448) or at die extreme 3' end of one primer where, under 
^>inoiniate conditions, mismatdi can prevent, or reduce polymonse extension 
(Prossner (1993) Tibtech 11-J38; Newton et al. (1989) NucL Adds Res. 17:2503). 
. This tedinique is also termed "PROBE" for Probe Oligo Base Extension. In addition 
it may be desirable to introduce a novel restriction site in die region of die mutation 
to create cleavage-based detection (Gaqiarini et al (1992) MoL Cell Probes 6:1). 

In anodier embodiment, idratification of the alldic variant is earned out using an 
oligonucleotide Ugation assay (OLA), as described, eg., in U.S. Pat No. 4,998,617 
and in Landegren, U. et aL, Sdenoe 241:1077-1080 (1988). The OLA protocol uses 
two oligonucleotides vAash are designed to be cqiable of hybridizing to abutting 
sequences of a smgle strand of a taiget One of die oligonudeotides is linked to a 
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sq)aration mariner, e.g,. biotinylated, and the other is detectabty labeled If the precise 
complementaiy sequence is found in a target molecule, the oligonucleotides will 
hybridize such tbaX their termini abut, and create a ligation substrate. Ligation then 
pemiits the labeled oligonucleotide to be recovered using avidin, or another biotin 
ligand. Nickerson, D. A. et al. have described a nucleic add detection assay that 
combines attributes of iPCR and OLA (Nickerson. D. A. et al., Proc. Natl. Acad. Sd. 
(U,SA.) 87:8923-8927 (1990). In this mefliod, PGR is used to achieve the 
exponential an^jlification of target DNA, which iis then detected using OLA 

Several techniques based on this OLA method have been developed and can be used 
to detect specific allelic variants of a polymorphic region of a gene. For example, 
U.S. Pat No. 5,593,826 discloses an OLA using an oligonucleotide having 3*-amino 
groiq> and a 5 -phosphorylated oligonucleotide to form a conjugate having a 
phosphoramidate linkage. In anolfaer variation of OLA described in Tobe et al. 
((1996)Nucldc Acids Res 24: 3728), OLA combined with PCR permits typing of 
two alleles in a single microtiter well. By maridng each of the alleie-spedfic primens 
wifli a unique hapten, i.e* digoxigenin and fluorescein, each LA reaction can be 
d^ected by using h^ten spedfic antibodies that are labeled with diflferrot en^e 
rqmters, alkaline phosphatase or hdrseradish peroxidase. This system permits ihe 
detection of the two alleles using a high thn>uglq)ut fcnmat. tiiat leads to the 
production of two different colors. 

The invention further provides methods for detecting single nucleotide poly- 
morphisms in a gene. Because single nucleotide polymorphisms constitute sites of 
variation flanked by regions of invariant sequence, their analysis requires no more 
than the determination of the identity of the single nucleotide present at the site of 
variation- and it is urmecessary to determine a cQnq>lete ^ne sequence for each 
patient Several methods have been developed to fidlitate the analysis of sudi single 
nucleotide polymorphisms. 
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In one embodiment, the single base polymorphism can be detected by using a 
specialized exonuclease-resistant nucleotide, as disclosed, «.g., m Mundy, C. R. (U.S. 
Pat. No. 4,656,127). According to flie method, a primer complemCTtary to the allelic 
sequence immediately 3* to the polymoiphic site is permitted to hybridize to a toget 
molecule obtained fiom a particular animal or human. If the polymorphic site on the 
target molecule contains a nucleotide that is complementaiy to the particular 
exohiiclease-resistant nucleotide derivative present, then that derivative will be 
incorporated onto the end of the hybridized primer. Such incoiporation renders the 
primer resistant to exonuclease, and thereby permits its detection. Since the identity 
of the exonuclease-resistant derivative of the sample is known, a finding that the 
primer has become resistant to exonucleases reveals that the luicleotide present in the 
polymorphic site of the target molecule was con^lonentaiy to Oat of the nucleotide 
derivative used in the reaction. This method has tibe advantage that it does not require 
the determination of large amounts of extraneous sequence ^?tfl 

In another embodiment of. the invoition, a solution-based method is used for 
detenninmg the id«tity of the nucleotide of a polymorphic site. Coh«, D. et aL 
(French Patent 2,650,840; PCT Appto. No. WO91/02087). As in the Mundy method 
of US. Pat No. 4,656,127, a primer is employed that is conq)lementaiy to allelic 
sequences immediately 3* to a polymorphic site. The mettiod determines the identity 
of the nucleotide of that site using labeled dideoxynucleotide daivatives, which, if 
conq)lemcntary to the nucleotide of tiie polyrnorphic site wiU becornc incorporated 
onto the tenninus of the primer. 

An alternative method, known as Genetic Bit Analysis or GBA TM is desoibed by 
Goelet, P. et al. (PCT ApphL No. 92/15712). ITie method of Goelet, P. ct aL uses 
mixtures of labeled terminators and a primer that is conq)lementary to the sequence 
3* to a polymorphic site. The labeled terminator that is moorporated is thus 
determined by, and ocrnqjlementary to, the nucleotide present in the polymorphic site 
of the target molecule being evaluated In contrast to the mebod of Cohen et al. 
(French Patent 2,650,840; PCT AppbL No. WO91/02087) the method of Goelet, P. et 
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aL is preferably a heterogeneous phase assay, in which the primer or the target 
•molecule is immobilized to a solid phase. 

Recently, several primer-guided nucleotide incorporation procedures , for assaying 
polymoiphic sites in DNA have been described (Komher, L S. et al., NucL Acids. 
Res, 17:7779-7784 (1989); Sokolov, B. P., Nucl. Acids Res. 18:3671 (1990); 
Syvanen, A: -C., et al,, Genomics 8:684-692 (1990), Ki^uswamy, M N. ct al., 
Proc. Natl. Acad Sci. (U.SA.) 88:1143-1147 (1991); Prezant, T. R. et al., Hum. 
Mutat 1:159-164 (1992); UgozzoH, L et aL, GATA 9:107-112 (1992); Nyrcn, P. ct 
al., Anal. Biochem. 208:171-175 (1993)). These methods .differ fiom GBA TM in 
that they all i:ely on the incorporation of labeled deoxynucleotides to discriminate 
between bases at a polymorphic site. In such a format, since the signal is proportional 
; to the number of depxynucleotides incorporated, polymorphisms fliat occur in runs of 
the same nucleotide can result in signals that are proportional to the length of the run 
(Syvanen, A.-C., et aL, Amer. J. Hudl Genet 52:46^59 (1993)). 

For detennining the identity of the allelic variant of a polymorphic region located in 
die coding regicm of a gene, yet other methods than fliose described above can be 
used. For example, identification of an allelic variant which encodes a mutated gene 
protdn can be performed by using an antibody specifically recognizing the mutant 
protein in, e.g., immunohistochemistiy or immundprecipitatioiL Antibodies to wild- 
type gene protein arc described, e.g., in Acton et al. (1999) Science 271:518 (anti- 
mouse gene antibody cross-reactive with human gene). Other antibodies to wild-type 
gene or mutated forms of gene proteins can be prepared accoidiiig to methods known 
in the art Alternatively, one can also measure an activity of an gene protein, such as 
binding to a lipid or lipoprotein. Binding assays are known in the art and involve, 
e.g., obtaining cells fixmi a subject, and performing binding experiments with a 
labeled lipid, to determine wfaedier binding to the mutated form of the recq)tor 
differs &om binding to the wild*type of the rccq>tor. 
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If a poiymoiphic region is located in an »on, either in a coding or non-coding region 
of the gene, the identity of the allelic variant can be determined by detennining iho 
molecular structure of the mRNA, pre-mRNA, or cDNA. The molectilar stnicture can 
be detennined using any of the above described methods for detemining the 
molecular structure of the genomic DNA, e.g., sequencing and SSCP. 

The methods described herein may be performed, for example, by utilizing pre- 
packaged diagnostic kits, such as those described above, comprising at least one 
probe or primer nucleic acid described herein, which may be conveniently used, e.g., 
to determine whether a subject has or is at risk of developing a disease associated 
with a specific gene allelic variant 

Sample nucleic acid for using in the above-described diagnostic and prognostic 
methods can be obtained fiom any cell type or tissue of a subject. For exan^le, a 
subject's bodily fluid (e.g. blood) can be obtained by known techniques (e.g. 
venipuncture) or fixnn human .tissues like heart (biopsies, transplanted organs). 
Alternatively, nucleic acid tests can be performed on diy sanqiles (e.g. hair or skin). 
Fetal nucleic acid sao^les for prenatal diagnostics can be obtained fiom matemal 
blood as described in International Patent >^lication No.WO91/07660 to Bianchi. 
Altemativeb^ amniocytes or chorionic villi may be obtained for perfomsing prenatal 
testing. 

Diagnostic procedures may also be perfomied in situ directly vq^n tissue sections 
(fixed and/or fix)zen) of patient tissue obtained fiom biopsies or resections, such that 
no nucleic acid purification is necessary. Nucleic acid reagents may be used as 
probes and/or primers for such in situ procedures (see, for example, Nuovo, G. J., 
1992, PCR in situ hybridization: protocols and iqiplications, Raven Press, New 
Yoric)., 

In addition to methods which focus primarily on the detection of one nucleic add 
sequence, profiles may also be assessed in such detection scfaraies. Fingerprint 
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profiles may be genoated, for example, by utilizing a diflferential display pnxjedurc. 
Northern analysis and/or RT-PCR. 

In practicing the present invention, flie distribution of polymoiphic patterns in a laige 
number ofJndiyiduals exhibiting particular markers of cardiovascular status or dmg 
response is determined by any of the methods described above, and compared with 
the distribution of polymorphic patterns in patients that have been matched for age, 
ethnic origin, and/or any other statistically or medically relevant parametas, who 
exhibit quantitatively or qualitatively diflferent status markers. Ck)irelations are 
achieved using any method known in the art, including nominal logistic r^ression, 
chi square tests or standard least squares regression analysis. In this manner, it is 
possible to establish statistically significant correlations between particular poly- 
morphic patterns and particular cardiovascular statuses (given in p values). It is' 
further possible to. establish statistically significant correlations between particular 
polymorphic patterns and changes in cardiovascular status or drug response such as, 
would result, e.g.. fiom particular treatment r^gfancns. In fliis manner, it is possible to 
correlate polymorphic patterns with lesponsivi^ to particular troat^ffnts * 

In another embodimot of the present invention two or more polymoiphic i^ons are 
combined to define so called •hq)lotypes* . Hq)lotypes are groi5>s of two or more 
SNPs that are fimctionaHy and/or spatially linked It is possible to combine SSPs tiiat 
are disclosed in the present invention either wifli each other or with additional 
polymorphic regions to form a h^lotype. Hq>lotypes are expected to give better 
predictive/diagnostic information than a single SNP. 

In a preferred embodunent of the present invoition a panel of SNPs/haplotypes is 
dcfined.that predicts die risk for CVD or drug response. IWs predictive panel is then 
used for genotyping of patients on a platform fliat can genotype multgjle SNPs at the 
same time (Multiplexing). Preferred platforms are e.g. gene diips (Aflfymetrix) or the 
Luminex LabMAP reader. The subsequent identification and evaluation of a patient's 
haplotype can then help to guide specific and individualized tiierapy. 
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For exaiiq>le flie present invention can identify patients exhibiting genetic 
polymoiphisms or lu^lotypes i^ch indicate an ino^ed risk for adverse drug, 
reactions. In that case the drug dose should be lowered in a way that ttie risk for ADR 
is diminished. Also if fbie patienf s response to drug administration is particularfy 
high (or the patient is badly metabolizing the drug), the drug dose should be low^ 
to avoid the risk of ADR. 

In turn if the patient's response to drug administration is low (or the patient is a 
particularly high metabohzer of tiie drug), and there is no evident risk of ADR, the 
drug dose should be raised to an efficacious level. 

It is self evident that the ability to predict a patient's individual drug response should 
affect the formulation of a drug^ i.e. drug formulations should be tailored in a way 
that, they suit the diffoent patient classes (lowAigh responder, poor/good 
metabolizer, ADR prone patients); Those different drug formulations may enconq)as8 
different doses of the drug, i.e. tht medicinal products contains low or higih amounts 
of flie active substance. In anottier embodiement of the invention flie drug 
formulation may contain additional substances that fiidlitate tfie benefidal effects 
and/or diminish ttie risk for ADR CFoDcersctaL 1991,USPat 5,316,765). 

Isolated PolvmorDhic Nucleic Acids. Probes, and Vectors 

The present invention provides isolated nucleic acids comprising the polymorphic 
positions described haein for human genes; vectors comprising the nucleic acids; 
and transformed host cells comprising the vectors. The invention also provides 
probes which are useful for detecting tiiese polymorphisms. 

In practicing the present invention, many conventional tedmiques in molecular 
biology, microbiology, and recombinant DNA, are used. Sudi techniques are well 
known and are explained fiiUy in, for example^ Sambrook et al., 1989, Molecular 
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aoDing: A Laboratoiy Manoal, Second Edition, Cold Spring . Haibor Laboratoiy 
Press, Cold Spring Harbor, New Yoric; DNA Qoning: A Practical Approach, 
Volumes I and II, 1985 (D. N. Glover ed.); Oligonucleotide Synthesis, 1984, (M. 
LGait ed.); Nucldc Add Hyfaddization. 1985, (Hames and Ifiggins); Ausubel et aL, 
Cuntnt Protocols in Molecular Biology, 1997, (John >^ey and Sons); and Methods 
in Enzymology VoL 154 and VbL 155 (Wu and Grossman, and Wu, eds., 
respectively). 

Insertion of nucleic adds (typically DNAs) comprising flie sequences in a functional 
surrounding like full length d)NA of the present invention into a vector is easfly 
acconq)lished when the tennini of both the DNAs and the vector conqnise 
compatible restriction sites. If this cannot be done, it may be necessary to modify fbs 
tennini of flie DNAs and/or vector by digesting back single-stranded DNA overhangs 
generated by restriction endomiclease cleavage to produce bhmt ends, or to adiieve 
fbe- same result by filling in flie single-stranded tomini vnfb. an zppiopnatc DNA 
pofymerase.' 

Alternatively, any site desired may be produced, tg., by ligating nucleotide so- 
quences (linkos) onto the tennini. Sudi linkere may conqnise specific oligo- 
mideo^de sequences tbsi. define desired restriction sites..Reslriction sites can also be 
geoCTated by flie use of fbs polymerase chain reaction (PCR). See, e.g., Saiki et al., 
1988, Science 239:48. The cleaved vector and the DNA fiiigments may also be 
modified if required by homopolymeric tailing. 

The nucleic adds may be isolated directly fiom cells" or m^ be chemically 
synthesized using known me&ods. Alternatively, fbe polymerase diain reaction 
(PCR) mediod can be used to produce tibe nucldc adds of die invention, using a&ia 
chemically syxOaemed strands or genomic material as ten^lates. Primers used for 
PGR can be synthesized using fbt sequace infomation provided hodn and can 
furtiier be designed to introduce ^>piopriate new restriction sites, if desirably to 
&cilitate incorporation into a given vector for recombinant expressiim. 
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The nucleic adds of the present invention may be flanked by native gene sequences, 
or may be associated with heterologous sequences, including promoters, enhancers, 
response elements, signal sequences, polyadenylation sequences, imroqs, 5'- and 3'- 
noncoding regions, and the like. The nucleic acids may also be modified by many 
means known in the art Non-limiting examples of such modifications include 
methylation, "caps", substitution of one or more of the naturally occurring 
nucleotides with an analog, intemucleotide modifications such as, for example, those 
with uncharged linkages (e.g., methyl phosphonates, phosphotriesters, phosphoro- 
amidates, carbamates, moipholines etc.) and with charged linkages . (e.g., 
phosphorothioates, phosphorodithioates, etc.). Nucleic acids may contain one or 
more additional covalently linked moieties, such as, for exan^te, proteins (eg., 
nucleases, toxins, antibodies, signal peptides, poly-L-lysine, etc.), intercabtors (e.g., 
acridine, psoralen, etc.), chelators (e.g, metals, radioactive metals, iron, oxidative 
metals, etc.), and alkylators. PNAs are also included. The nuchac acid may be 
derivatized by formation of a mediyl or ethyl phosphotiiesler or an alkyl 
phosphoramidate linkage. Furthemiore, the nucleic add sequences of the present 
invention may also be modified with a labd capable of providing a detectable signal, 
dtfaer di^y or indirecfly. Exemplary labels inchide radioisotopes, fluorescent 
molecules, biotin, and fbs like. 

The invention also provides nucldc add vectors comprising flie gene sequences or 
derivatives or fiagments thereof of genes described in the Examles. A large number 
of vectors, including plasmid and fungal vectors, have been described for replication 
and/or expression in a variety of eukaiyotic and proka^'otic hosts, and may be used 
forgone ther^y as weU as for simple cloning or protein expression. Non-limitii^g 
examples of suitable vectors include without limitation pUC plasmids, pEF plasmids 
(Novagen, Inc., Madison, Wis.), or pRSET or pREP (Invitrogen, San Diego, CUif), 
and many appropriate host cdls, using methods disclosed or dted herein or otfaeiwise 
known to those skilled in the relevabt ait The particular dioice of vector/host is not . 
critical to the practice of the invention. 



wo 2004/018709 



PCT/EP2003/009126 



■92- 

Suitable host cells may be tcansfonned/transfec^ed/infected as i9)propriate by any 
suitable mefliod including electropoiation, CaCh mediated DNA iq)take, iiingal or 
viral infection, microinjection, microprojectile, or other established m^ods. 
Appropriate host cells included bacteria, archebacteria, fimgi, especially yeast, and 
plant and animal cells, especially mammalian cells. A large number of transcription 
initiation and tennination regulatory regions have been isolated and shown to be 
effective in the transcription and translation of heterologous proteins in tiie various 
hosts. Examples of these regions, methods of isolation, manner of manipulation, etc. 
are known in the art Under i^jpropriate e>q>ression conditions, host cells can be used 
as a source of recombinantly produced peptides and polypeptides encoded by genes 
of the Exanq)les. Nucleic acids encoding peptides or polypeptides fiom gene 
sequences of the Examples may also be introduced into cells by recombination 
events. For example, such a sequence can be introduced into a cell and ther^y effect 
homologous recombination at die site of an endogenous gene a: a sequence with 
substantial identity to the gene. Other recombination-based mediods sudi as non- 
homologous recombinations or deletion of endogenous genes by homologous 
recombination may also be used. 

In case of proteins that form heteiodimers n otiier multimers, bofli or all subunits 
have to be expressed m one system or cell 

The nucleic adds of the presoit mvcntion find use as probes for the detection of 
genetic polymorphisms and as templates for flie recombinant production of normal or 
variant peptides or polypeptides encoded by genes listed in the Exanq)les. 

Probes in accordance with the present invention conq)rise without limitation isolated 
nucleic acids of about 10-100 bp, preferably 15-75 bp and most preferably 17-25 bp 
in length, which hybridize at high striAgency to one or more of the polymoiphic 
sequences disci sed herein or to a sequence immediately adjacent to a pofymoiphic 
position. Furdiennore, in some embodiments a fiiU-length gene sequence may be 
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used as a probe. In one series of embodiments, the probes span the polymoipliie 
positions in genes disclosed herein. In ano&er series of embodiments, Ae probes 
correspond to sequences immediately adjacent to the polymorphic positions. 

Polymorphic Polypeptides and PolvmorDhism-Specific Antibodies 

The present invention encompasses isolated peptides and polypeptides encoded by 
genes listed in the Exan^les comprising polymorphic positions disclosed herein. In 
one prefeired embodiment, the peptides and polypeptides are useful screenhig targets 
to identify cardiovascular drugs. In another preferred embodiments, the pq>tides and 
polypeptides are c^^able .of eliciting antibodies in a suitable host animal that react 
specifically with a polypeptide comprising the polymorphic position and distinguish 
it fiom other polypq>tides having a different sequence at that position. 

Polypeptides according to the invention are prefmbly at least five or more residues 
in length, preferably at least fifteen residues. Mediods for obtaining tiiese poly* 
pq>tides are described below. Many conventional techniques in protein biodiemistty 
and immunology are used. Such techniques are well known and are explained in 
Immunochonical Methods in Cell and Molecular Biology, 1987 (Mayer and Waler, 
eds; Academic Press, London); Scopes, 1987, Protem Purification: Princqples and 
Practice, Second Edition (Springer-Verlag, N.Y.) and Handbook of Esqmimmtal 
Immunology, 1986, Vohmies I-IV (Weir and Blackwell eds.)* 

Nucleic acids conr^rising protein-coding sequences can be used to direct the ITT 
recombinant eT^ression of polypeptides encoded by genes disclosed herein in intact 
cells or in cell-fi:ee translation systems. The known goietic code, tailored if desired 
for more efScient expression in a given host organism, can be used to synfiiesize 
oligonucleotides encoding the desired ammo add sequences. The polypeptides may 
be isolated fit>m human cells, or fi-om heterologous organisms or cells (includmg, but 
. not limited to, bacteria, fimgi, insect, plant, and mammalian cells) into \Mch an 
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appropriate protein-coding sequence has been introduced and expressed. Further- 
more, the polypq>tide8 may be pait of recombinant fiisioD protons. 

Peptides and polypeptides may be chemically synthesized by commereuaiy available 
automated procedures, including, without limitation, exclusive soUd phase synthesis 
partial solid phase methods, fiagment condensation or classical sohition synthesis. 
The ^)oIypqJtides are preferably prepared by solid phase peptide synthesis as 
described by Menifidd. 1963, J. Am. Cbem. Soc. 85:2149. 

Methods for polypqrtidc purification are well-known in the art, including, without 
limitation, preparative disc-gel electrophoresis, isoelectric focusing, HPLC, reversed- 
phase HPLC, gel filtration, ion exchange and partition chromatogi?q)hy, and 
countercunent distribution. For some purposes, it is preferable to produce the 
polypeptide in a recombinant system in vAdch the protein contains an additional 
sequence tag that facilitates purification, such as^ but not limited to, a polyUstidine 
sequence. The polypeptide can then be purified fiom a crude lysate of flie host ceU by 
chromatography on an appropriate solidiihase matrix. Alternatively, antibodies 
produced against peptides encoded by genes disclosed her«n, can be used as 
purificatioo reagents. OOier purification m^hods are possible. 

This present invoiticm also enconq>asses <toivatives and homologues of the poly- 
pqrtides. For some purposes, nucleic add sequences encoding the peptides may be 
altered by substitutions, additions, or deletions that provide for functionally 
equivalent molecules, i.e., function-conservative variants. For example, one or more 
amino acid residues within the sequence can be. substituted by another amino acid of 
similar properties, such as, for exanqile, positively charged amino acids (arginine, 
lysine, and. histidine); negatively charged amino adds (aqiartate and ghitamate); 
polar neutral amino adds; and non-polar amiiin adds. 

Ibe isolated polypeptides m^ be modified by, for example, phosphorylation, 
sulMon. acylation. or other proton modifications. They may also be modified with. 
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a label Cdpzble of providipg a detectable signal, either directly or indiiexctly, 
including, but not limited to, radioisotq)es and fluorescent compounds^ . 

The present invention also eQConq)asses antibodies that specifically recognize the 
polymoiphic positions of the invention and distinguish a peptide or polypeptide 
containing a particular polymoiphism fiom one that contains a different sequence at 
that position. Such polymorphic position-specific antibodies according to the present 
invention include polyclonal and monoclonal antftodies. The antibodies may be 
elicited in an animal host by immunization mth peptides encoded by genes disclosed 
herein or may be fonned by in vitro immunization of immune cells. The immuno- 
genic components used to elicit the antibodies may be isolated 6om human cells or 
produced in recombinant systmis. The antibodies may also be produced in 
recombinant systems programmed with sppropriate antibody-encodmg DNA. 
Altonatively, the antibodies may be constructed by biodiraucal reconstitution of 
ptnified heavy and light chains. The antibodies include hybrid antibodies (Le., 
containing two sets of heavy chain/light chain combinations, each of which 
recognizes a different antigen), chimeric antibodies (i.e^ in which either the heavy 
chains, light chains, or both, are fiision proteins), and univalent antibodies (Le., 
conqmsed of a heavy chain/light chain cosq)Iex bound to die constant region of a 
second heavy chain). Also inchided are Fab fiagmats, mcluding Fab* and 
F(ab).sub.2 firagments of antibodies. Methods for the production of all of the above 
types of antibodies and derivatives are well-known iii the art and are discussed in 
more detail below. For exanq>le, techniques for producing and processing polyclonal 
antisera are disclosed in Mayer and Walker, 1987, Immunochemical Methods in Cell 
and Molecular Biology, (Academic Press, London). The genaal methodology for 
making monoclonal antibodies by hybridomas is well known. Immortal antibody- 
producing cell lines can be created by cell fusion, and also by other techniques such 
as direct transformation of B lynaphocytes with oncogenic DNA, or transfection widi 
Epstein-Bair virus. See, e.g., Schreier ct al., 1980, Hyhridoma Techniques; U.S. Pat 
Nos. 4,341,761; 4^99,121; 4,427,783; 4,444,887; 4,466,917; 4,472,500; 4,491,632; 
and 4,493,890. Panels of monoclonal antibodies produced against peptides encoded 
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by genes disclosed herein can be screened for various properties; i.e. for isotype, 
epitope afiSnity, etc. 

The antibodies of this invention can be purified by standard metinxls, including but 
not limited to prqparative disc-gel electrophoresis, isoelectric focusing, HPLC, 
reversed-phase HPLC, gel filtration, ion exchange and partition chromatography, and 
countercurrent distribution. Purification methods for antibodies are disclosed, e.g., in 
The Art of Antibody Purification, 1989, Amicon Division, W. IL Grace & Co. 
General protem purification methods are described in Protein Purification: Principles 
and Practice, R. K. Scopes, Ed., 1987. Springer-Verlag, New Yoric, N.Y. 

Methods for determining the immunogenic cq)ability of the disclosed sequences and 
the characteristics of the resulting sequence-specific antibodies and immune cells are 
well-taiown in the art For exan^le, antibodies elicited in response to a pq>tide 
con^yrismg a particular polymorphic sequence can be tested for flieir ability to 
specifically recognize that polymorphic sequence, Le., to bind diflFcrentially to a 
pq>tide or polyp^tide comprising tiie polymorphic sequence and thus distinguish it 
fix)m a amilar peptide or polypqitide containing a different sequnce at the same 
positiorL 

Kits 

As set forth herein, tiie invention provides diagnostic methods, e.g., for determining 
the identity of the allelic variants of polymorphic regions present in die gene lod of 
genes disclosed herein, wherein specific allelic variants of the polymorphic region 
are associated with cardiovascular diseases. In a preferred embodiment, the 
diagnostic kit can be used to detomine whether a subject is at risk of developing a 
cardiovascular disease. This information could then be used, eg., to optimize 
treabnot of such individuals. 
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' In prefemed enibodiments, the kit comprises a probe or primer which is capable of 
hybridizing to a gene and thereby identifying whedier the gene contains an allelic* 
v^ant of a polymorphic region which is associated vnfh a risk for cardiovascular 
disease. The kit preferably furth^ conqsrises instructions for use in .diagnosing a 
subject as having, or having a predisposition, towards developing a cardiovascular 
disease. The probe or primers of the kit can be any of the probes or primers described 
in this file. 

Preferred kits for amplifying a region of a gene comprising a polymorphic region of 
interest comprise one, two or more primers. 

Antibody-based diagnostic methods and kits 

The invention also provides antibody-based methods for detecting polymorphic 
patterns in a biological sample. The methods comprise tiie stq)s of: (i) contacting a 
saniple wifli one or more antibody prqiarations, wherein eadi of the antibody 
prqiarations is specific for a particular polymorphic form of the proteins encoded by 
genes disclosed herein, under conditions in ixduch a stable antigens-antibody complex 
can form between the antibody and antigoiic components in die sanq>le; and Qi) 
detectfaig any antigen-antibody complex formed in step (i) using any suitable means 
known in the art» wherein die detection of a con4)lex indicates the presence of the 
particular polymorphic form in the sample. 

Typically, immunoassays use either a labelled antibody or a labelled antigenic 
conq)onent (e.g., that competes with the antigen in the sample for binding to the 
antibody). Suitable labels include without limitation enzyme*based, fluorescent, 
chemiluminescent, radioactive, or dye molecules. Assays that mphfy the signals 
fiiom the probe are also known, such as, for example, diose diat utilize biotin and 
avidin, and enq^e-labelled immunoassays^ such as ELIS A assays. 
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The present invention also provides kits suitable for antibody-based diagnostic 
plications. Diagnostic kits, typically include one or more of the. following 
components: 

(i) Polymorphism-specific antibodies. The antibodies may be pre-labeUed; 
alternatively, the antibody may be unlabelled and fte ingi^ents for labelling 
may be included in the kit in separate containers, or a secondaiy, labelled 
antibody is provided; and . 

(ii) Reaction conq)onents: The kit may also contain other suitably packaged 
reagents and materials needed for the particular immunoassay protopol, 
including solid-phase matrices, if qiplicable, and standards. 

The kits referred to above may include instructions for .conducting the test 
Furthennore, in preftnred embodiments, the diagnostic kits are adaptable to hi^ 
tfan>uglq>ut and/or automated operation. 

Drag Targets and Screen im; Mftthndg 

According to the present invention, nucleotide sequences derived fiom genes dis- 
, closed heron and pq)tide sequences encoded by genes disclosed herein, particularly 
fliose that contain one or more polymoiphic sequ^ces, comprise useful taigets to 
identify cardiovascular drugs, i.e., con:q)ounds tiiat are eflFective in treating one or 
more clinical symptoms of cardiovascular disease. Furthennore, especially when a 
proteiiL is a multimcric protein that are build of two or more subunits, is a 
combination of different polymorphic subunits veiy useful. 

Drug targets include without limitation (i) isolated nucleic adds derived fiom the 
genes disclosed herein, and &) isolated pq)tides and polypq>tides encoded by genes 
disclosed herein, each ofwhichconq>rises one or more polymoiphic positions. 
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Id vitro screening methods 

In one series of embodiments, an isolated nucleic acid comprising one or more 
polymorphic positions is tested in vitro for its ability to bind test compounds in a 
sequence-specific mamier. The methods comprise: 

(i) providing a first nucleic acid containing a particular sequence at a polymorphic 
position and a second nucleic acid whose sequence is identical to that of the first 
nucleic acid except for a different sequence at the same polymorphic position; 

(ii) contacting the nucleic acids with a multiplicity of test compounds under 
conditions q)pn)priate for binding; and 

(iii) identifying fliose compounds that bind selectively to either die first or second 
nucleic add sequence. . 

Selective bmding as used herein refers to any measurable difference in any parameter 
of binding, such as, e.g., bindiiig afiSnity, binding capacity, etc. 

In ano&er scries of embodiments, an isolated peptide or polypq)tide comprising one 
or more polymorphic positions is tested in vitro for its abilify to bind test compounds 
in a sequence-specific manner. The screening methods involve: 

(i) • providing a first peptide or polypeptide containing a particular sequence at a 

polymorphic position and a second pq^tide or polypeptide whose sequence is 
identical to the first pq>tide or polypeptide excqpt for a different sequence at 
the same polymorphic position; 

(ii) contacting the polypeptides with a multiplicity of test conq>ounds under 
conditions appropriate for binding; and 
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(iii) identifying those compounds that bind selectively to one of the nucleic add 
sequences. 

In prefened embodiments, high-toougjqjut screaiing protocols are used to survey a 
large number of test compounds for their ability to bind the genes or peptides 
disclosed above in a sequence-specific manner. 

Test compounds are screened fiiom large libraries of synthetic or natural compounds. 
Numerous means are currently used for random and directed synthesis of saccharide, 
pqjtide, and nucleic acid based compounds. Synthetic conqwund hbraries are 
commercially available fmm Maj^ridge Chemical Co. (TreviUet, Cornwall, UK), 
Gomgenex (Princeton, NJ.), Brandon Associates (Merrimack, NJL), and 
Microsource (New Milford, Conn.). A rare chemical library is available fiom Aldrich 
(Milwaukee, Wis.). Alternatively, fibraries of natural conqMunds in tiie form of 
bactaial, fungal, plant and animal extracts are available firan e.g. Pan Laboratories 
(Bothell, Wash.) or MycoSeandi (N.C.), or are readily produdble. AdditionaUy, 
natural and syndieticalfy produced fibraries and compounds are readify modified 
throu^ conventional chemical, phyacal. and biochemical means.. 

In vivo screening methods 

Intact cells or whole animals expressing polymorphic variants of genes disclosed 
herein can be used in screening mediods to identify candidate cardiovascular drugs. 

In one series of embodiments, a pemianent ceU line is established fiom an individual 
exhibiting a particular polymorphic pattern. Altanativdy, cells (including without 
Umitation mammalian, insect, yeast, or bacterial cells) are programmed to express a 
gene comprising one or more polymorphic sequences by introduction of appropriate 
DNA. Identification of candidate compounds can be achieved usmg any suitable 
assay, inchiding without limitation (0 assays that measure selective bindirig of test 
compounds to particular polymorphic variants of proteins encoded by genes 
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disclosed herein; (ii) assays fliat measure the ability of a test, conqwund to modify 
(i.e., inhibit or enhance) a measurable activity or function of proteins encoded by 
genes disclosed herein; and (iii) assays that measure the ability of a compound to 
modify (i.e., inhibit or enhance) the transcriptional activity of sequcsnces derived 
5 from the promoter (i.e., regulatory) regions of genes disclosed herein. 

In anotha series of embodiments, transgenic animals are created in which (i) one or 
more himian genes disclosed herein, having diflFerent sequences at particular 
polymorphic positions are stably inserted mto the genome of the transgenic animal; 
and/or (ii) the endogenous genes disclosed herein are inactivated and replaced with 
human genes disclosed herein, having different sequences at particular polymorphic 
positions. See, e.g., Cof&nan. Semin. Nephrol. 17:404, 1997; Esther et al. Lab. 
Invest. 74:953, 1996; Murakami et al.. Blood Press. Suppl. 2:36, 1996. Such animals 
can be treated with candidate compounds and monitored for one or more clinical 
markers of cardiovascular status. 

The following are intended as non-limiting exanq>les of flie inventicm. 
Materia! and Methods 

Genotyping of patient DNA with the Pyrosequencmg™ Method as described in the 
patent application WO 98 13523 : 

First a PGR is set up to amphfy the flanking regions around a SNP. Therefor 2 ng of 
genomic DNA (patient sample) are mixed with a primerset (20 - 40 pmol) producing 
a 75 to 320 bp PGR finagmcnt with 0,3 to 1 U (Jiagens Hot Star Taq Polymerase™ in 
a total vohune of 20 )iL. One primer is biotmylated depending on the direction of die 
sequencing primer. To force the biotinylated primer to be incorporated it is used 0,8 
fold. 
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For primer design, programms like Oligo 6™ (Molecular Biology Insists) or Primer 
Select™ (DNASter) are used.PGR setup is pcrfonned by a BioRobot 3000 ™ fiom 
Qiagen. PQR takes place in T 1 or Tgradient ThermocyclOT ™ from Biometra. 

The whole PGR reaction is transferred into a PSQ plate ™ (Pyrosequencing) and 
prepared using Ae Sample Prq) Tool ™ and SNP Reagent Kit ™ fiom Pyro- 
sequencing according to their instnictions. 

Preparation of template for PvTOseouencIng™! 

Sample priq>aration using PSQ 96 Sample Pr^ Tool: 

1. Mount ±c PSQ 96 Sample Pr^ Tool Cover onto the PSQ 96 Sample Prep 
. Tool as fi)llows: Place the cover on the desk, retract the 4 attachment rods by 

sqiarating the handle fiom the magnetic rod holder, fit the magnetic rods into 
the holes of the cover plate, push the handle downward until a click is heard. 
The PSQ 96 Sanq>le Pt)q> Tool is now ready for use. 

2. To transfer beads fix>m one plate to anoflier, place the covered tool into flie 
PSQ 96 Plate containing the sanq>les and lower the magnetic rods by 
separating the handle fiom the magnetic rod holder. Move the tool iq>, and 
down a few times then wait for 30-60 seconds. Transfer the beads into a new 
PSQ 96 plate containing &e solution of choice. 

3. Release the beads by lifting the magnetic rod holder, bringing it togetho- with 
the handle. Move the tool vp and down a few times to make sure that the 
beads are released. 

All steps are performed at room temperature unless otherwise stated. 

Immobilization of PGR product: 
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Biotmylated PCR products are immobilized oa sti^tavidin-coated Dynabeads™ M- 
280 Streptavidin. Parallel immobilization of several sanq>les are perfimned in flie 
PSQ 96 Plate. 

1. Mix PCR product, 20^1 of a weU optimized PCK, with 25 Kd2XBW.bu£fer 
n. Add 60-150 ^g Dynabeads. It is also possible to add a mix of Dynabeads 
and 2X BW-bufiFer H to the PCR product yieldmg a final BW-buflFer n 
concentration of approximately Ix. 

2. Incubate at eS'C for 1 5 min agitation constantly to keep the beads dispened 
For optimal immobilization of fi^agments longer than 300 bp use 30 min 
incubation time. 

Strand separation: 

4. For strand separation, use the PSQ 96 Sample Pwp Tool to transfer the beads 
with the immobilized sample to a PSQ 96 Plate containing 50 |il 0.50 M 
NaOH per wdL Release Oie beads. 

5. A&ec proximately 1 min, transfer the beads with the immobilized strand to 
a PSQ 96 Plate containing 99 jd Ix Annealing buffer per well and mix 
thoroughly. . 

6. Transfer the beads to a PSQ 96 Plate containing 45 pi of a mix of Ix 
Annealing buffer and 3-lS pmoles sequendng primer per well. 

7. Heat at 80»C for 2 minutes in the PSQ 96 Sample Prep Thermoplate and 
move to room tempeature. 

8. After readiing room tempoature, continue with the sequencmg reaction. 
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Sequencing reaction: 

1. Choose the method to be used C'SNP Method'*) and enter relevant 
mfoiination in the PSQ 96 Instrument Control software. 

2. Place the cartridge and PSQ 96 Plate in the PSQ 96 Instrument 
3.. StartthenuL 

Genotyping using die ABI 7700/7900 instrument (TaqMan) 

SNP genotypisation usmg the TaqMan (^lied Biosystems/Peridn Ehner) was 
performed according to flie manufacturer's instructions. The TaqMan assay is 
discussed by Lee et al.. Nucleic Adds Research 1993, 21 : 3761-3766. 

Genotyping with a service eontracton 

Qiagen GenonucSp formally Rapigene, is a service contractor for genotyping SNPs in 
patient samples. Their method is based on a primer extension mefliod where two 
complementary primes are designed for each genotype that are labeled with diflferent, 
tags. Depending on flie genotype only one primer will be elongated together with a 
certain tag. This tag can be detected with mass spectrometry and is a measure for the 
respective genotype. The method is described in flie following patent: **Detection and 
idmtification of nucleic acid molecules - using tags which may be detected by non- 
fluorescent spectrometry or potentiGmetry** (WO 9727325). 
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Examples 

To exemplify the present invention and it's utility baySNP 28 will be used in the 
following: 

baySNP 28 is a C to T polymorphism and presumably resides in the gene of the 
human acidic 82 kDa protein (information taken from table 3). baySNP 28 was 
genotyped m various patient cohorts using the primers jfrom table 2. As a result the 
following number of patients carrying different genotypes were found (information 
combined from tables 3 and Sa): 



baySNP 


Cohort 


Total 


Genome 11 


Genotype 12 


Genotype 22 








"CC" 


"CT" 




28 


HELD^FEM^HIRESP 


12 


1 


2 


9 


28 


HELD^FEM^LORESP 


22 


3 


12 


7 



When comparing flie numbo: of female patients exhibiting a high response to statin 
therapy (HELD_FEM_HIRESP) with the control cohort (HELD_^FEM_LORESP) it 
appears that the number of low respondors carrying flie CT genotype is increased. 
This points to a lower statin response among female individuals with the CT 
genotype, flying statistical tests qn those findings the following p-vahies were 
obtained (data taken fiom table Sb): 



BAYSNP 


COMPARISON 


GTVPE 
CPVAL 


GTYPE 
XPVAL 


GTYPE 
LRPVAL 


28 


HELD_FEM_EFF 


0,0506 


0.0S08 


0.0442 



As at least one of the GTYPE p values is below 0,05 the association of genotype and 
statin re^Nuise phenotype is regarded as statistically significant Le. fbe analysis of a 
patioif s genoQfpe can predict tiie reqxmse to statin Hienpy. Jn more detail one can 
calculate flie relative risk to exhibit a certain statin response phenotype when 
canying a certain genotype (data taken fiom table 6a): 
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BAYSNP 


COMPARISON 


GTYPEl 


GTYPE2 


GTYPE3 


RRl 


RR2 


RR3 


28 


HELD_FEM_BFF. 


cx; 


CT 


TT 


0.68 


0,29 


338 



In case of baySNP 28 the risk to exhibit a high responder phenotype is 3,38 times 
higher when canying the TT genotype. This indicates that a TP polymoiphism in 
baySNP 28 is an independent risk fector for high statin response in females. On the 
Qther band earners of a CT or CC genotype have a reduced risk of bemg a high 
responder. 



In addition statistical associations can be calculated on the basis on alleles. This 
calculation would identify risk alleles instead of risk genotypes. 

In case of baySNP 28 the following allele counts were obtained (data combined fiom 
tables 3 and 5a): 



baySNP 


Cohort 


Total 


Allele 1 


AUele2 












28 


HELD^FEMJURESP 


12 


4 • 


20 


28 


HELD_FEM_LORESP 


22 


18 


26 



When conq}aring the number of female patients wifli high statin response 
(HELD_FEM_HIRESP) with the control cohort (HELD_FEK1_L0RESP) it appears 
that the number of high responders canying the T allele is inoeased, vrbaeas tbe 
number of high responders canying the C allele is diminished. This points to a higher 
statiq response among female individuals with the T allele. Applying statistical tests 
on diose findings the following p>vahies were obtained (data taken fiom table 5b}: 



BAYSNP 


COMPARISON 


ALLELE 


ALLELE 


ALLELE 


CPVAL 


XPVAL 


LRPVAL 


28 


HELD_FEM_EFF 


0,0411 


0,0579 


0,0349 
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As at least one of the ALLELE p values is below 0,05 Hie association of aliele and 
statin response phenotype is regarded as statistically significant (in this example 
significant p values were obtained from two statistical tests). Le. also the analysis of 
a patient's alleles from baySNP 28 can predict the extend of statin response. In more 
detail one can calculate the relative risk to exhibit a certain statin response phenotype 
when carrying a certain allele (data taken fixnn table 6b): 



baySNP 


Allele 1 


Allele 2 


COMPARISON 


RRl 


RR2 


28 


C 


T 


HELDJFEM_EFF 


0,42 


2,39 



In case of baySNP 28 die risk to exhibit a high re^nder phenotype is 239 times 
higher whai canying the T allele. This indicates that the T allele of baySNP28 is an 
indq)aident risk factor for a high statin response in females. In other words those 
patients should receive lower doses of statins in order to avoid ADR. However due to 
their Tiigh responder* phenotype they will still benefit from the dmg. In turn carriers 
of the C allele should receive higha- drug doses in order to expmeacc a benefical 
thoiqieutic effect 

Another example is baySNP 29, which is taken to exemplify polymorphisms relevant 
for adverse drug reasons. baySMP 29 was found significant whoi conqiaring male 
patients with severe ADR to fbc respective controls (as defined in table lb). 

The rdative risk ratios for the genotypes AA, AG and GG were as foUows (data 
taken from table 6a): 



BAYSNP 


COMPARISON 


GTYPEl 


GTYPEl 


GTYPE3 


RRl 


RRl 


RR3 


29 


HELD_MAL_ADR3ULN 


AA 


AG 


GG 


3.15 


0.66 


0.32 



In this case male patients carrying the AA gaiotype have a 3,15 times higher risk to 
suffer fiwn ADR. In thw words tiiose patiaits should either receive lower doses of 
statins or switch to an alternative thenpy in order to avoid ADR. On die odier hand 
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male patients with AG or GG genotype dppear to be more resistant to ADR and 
hence better tolerate statin therapy. 

As can be seen fix>m the following tables some of the associations fliat .are disclosed 
in the present invention are indicative for more flian one phenotype. baySNP 1837 is 
for exdmplt linked to ADR, but also to 0ie risk to sufifer from CVD (table 6). 



Table la Definition of "good** and/*bad" serum. lipid levels 





"Good" 


"Bad" 


LDL-Oiolesterol [mg/dL] 


125 -150 


170 - 200 


Cholesterol [mg/dL] 


190-240 


265 - 315 


HDL-Cholesterol [mg/dL] 


60-105 


30-55 


Triglycerides [mg/dL] 


45-115 


170 -450 



According to the PROCAM algoridun (Assmann, G., Schulte, H. von Edondstem. 
A; Am J. Cardiol 77 (1996); 1179-1184) it is possible to define other cohorts. For 
example a lipid-based equation would calculate y as follows: 

y = -0.0146*LDL+0.0418*HDL-0.3362*Id(TRIGLY) 

Good or bad cohorts could then be defined in the following way (FEM. - female, 
MAL^male): 

FEM_GOODy>= -1.4 
FEM_BAD y< -1.4 
MAL_GOODy>= -1.7 
MAL.BAD y< -1.7 
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Table lb Definition of drug response phenotypes 



Low responder 


Decrease of serum LDL of at least 10% and at most 50% 
upon administration of 0.8 mg Cmvastatin (femnle patients) 


High responder 


Decrease of serum LDL of at least 50% upon administration 
of 0.4 me Cmvastatin (female patients) 


Veiy low responder 


Decrease of serum LDL of at least 10% and at most 35% 
upon administration of 0.8 mg Cerivastatin (female patients! 


Veiyhigh responder 


Decrease of serum LDL of at least 55% upon administration 
of 0.4 mg Cerivastatin (female patients) 


Ultra low respondo- 


Deaease of serum LDL of at least 10% and at most 25% 
upon administration of 0^8 mc Cerivastatin {frm^}/* r\nf^f*nia\ 


Ultra high responder 


Deaease of scrum LDL of at least 60% jspon administration 
of 0.4 mg Cerivastatin (female patients) 


Tolerant patient 


No diagnosis of muscle cramps, muscle pain, muscle 

weakness, myalgia or myopathv 

AND 

seram CK levels below 70 mg/dl in women and below 80 
mg/dlinmen. 


ADR patient 

(CK increase at least . 

2xULN) 


Diagnosis of muscle cramps, muscle pain, muscle 

weakness, myalgia or myopathy 

OR 

serum CK levels higher than 140 mg/dl in women and 160 
mg/dlinmen. 


Advanced ADR patient 
[ADR3] 

(advanced CK increase, 
at least SxULN)* 


Senm CK levels higher than 210 mg/dl in women and 240 
mg/dl in men 


Severe ADR patient 
[ADR5] 

(severe CK increase^ at i 
least SxULN)* 


Senmi CK levels higher than 350 mg/dl in women and 400 
tng/dH in men 



*; When assembling the cohorts for advanced and severe ADR we focused on the CK 



serum levels as those provide a more independent measure of statin related ADR. 
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Table Ic Definition of ^higii" and ''low*' serum HDL cholesterol levels 





Male 
individuals 


Fonale 
individuals 


,High' HDL-Cholestraol [mg/dL] ~ 


>=80 


>=104 


,Low' HDL-Cholesterol [mg/dL] 


<=35 





An infonned consent was signed by the patients and control people. Blood was taken 

by a physician according to medical standard procedures. 

Samples were collected anonymous and labeled with a patient number. 

DNA was ejctracted using kits from Qiagen. 

Table 2a Ol^onudeotide primers used for genofyping using mass spectre- 
metiy 



The baySNP number refers to an internal numbering of the PA SNPs. Primer 
sequences are listed for preamplification of ttie genomic fiagments (primers EF and 
ER) and for subsequoit allele q)ecific PGR of the SNP. 



baySMP 


SNP 


Name 


Sequence 


28 


C137T 


CF 


gggacggtcggte^gatTCTAGAATTGTGCTTCCC • 


28 


C137T 


EF 


TGTCCAGTGTTAG6AAAAA 


28 


C137T 


ER 


GACGATGCCTTCAGaiaWSATGTGGCTTCTC 


28 


C137T 


TP 


gctggctcggtcaagaTCEAGAATTGTGCTTCCT 


29 


A464G 


AF 


gggacggtcggtagatCATCGGTCAGTGTCCCCA 


29 


A4646 


EF 


GATGTCTGTCTCCTTGATGT 


29 


A464G 


ER 


GACXaTGCCTTCAGCACAATGTGGGGGTTTTATT^ 


29 


A4646 


GF 


gctggctcggtcaagaCATCXSGTCAGTGTCCCCG 


52 


C397G 


CR 


gggacggtcggtagatTATTTTATAATGCAAAAG 


52 


C397G 


EF • 


GAaSATGCCTTa^GCACAGTGAATTGCC^ 


52 


C397G 


ER 


TCTAAAGTGCTGGGATTG 


52 


C397G 


GR 


gctggctcggtcaagaTATTTTATAATGCAAAAC 
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SNP 


Name 


Sequence . . 


56 


. A429G 


AF 


gggacggtcggtagatAAGGTCTTTGTACOTGTA 


56 


A429G 


EF 


CCAGGTACTGCCTTACAAA 


56 


A429G 


ER 


GACGATGCCTTCAGCACAGCTCCCAAAATAAATCACTC 


56. 


A4296 


GF 


gctggctcggtcaagaAAGGTCTTTGTAOGTGTG 


89 


A159G 


.AR 


gggacggtcggtagatTGGAGTCGGGGGAGTCAT 


89 


A159G 


EF 


GAaSATGCCTTCAGCACATAGTTCAAGGGTAAAGGA 


89 


A159G 


ER 


GAGGACGAGATGTAAGAG 


89 


A159G 


6R 


gctggctcggtcaagaTGGAGTCGGGGGAGTCAC 


90 


C154T 


CF 


gggacggtcggtagatCAGCGCATCCTGAACCAC 


90 


C154T 


EF- 


GCTGGAACGAGTTCATCCT 


90 


C154T 


ER 


GACGATGCCTTCAGCACAGGACCCCACCITrCTTGT 


90 . 


C154T 


TF 


gctggctcggtcaagaCAGCGCATCCTGAACCAT 


99 


C58T 


.CR 


gggacggtcggtagatTCCTGCTCTTTTCTCTAG 


99 


C58T 


EF 


GACGATGCCTTCAGCACACACTGACIXXriTACT^^ 


• 99 


C58T 


ER 


TACTGTGTCTCAGCTCCA 


99 


C58T 


TR 


gctggctcggtcaagaTCCTGCTCTTTTCTCTAA 


140 


C468T 


CR 


gggacggtcggtagatGTGAATCCCAATACGAAG 


140 


C468T 


EF 


GACGATGCCTTCAGCACATAAAAAATAACCAGGTACTCCA 


140 


C468T 


ER 


GATGAGTCCTTCACCAAACATACA 


140 


C468T 


TR 


gctggctcggtcaagaGTGAATCCCAATACGAAA 


152 


A587G 


AF 


gggacggtcggtagatGGTGGGAGGTTCCAGCCA 


152 


A587G 


EF 


GCAGGAAGAAAGCTAGAA 


152 . 


A587G 


ER 


GACGATGCCTTCAGCACAAGGCAGGATAATGACAAC • 


152 


A587G 


GF 


gctggctcggtcaagaGGTGGGAGGTTCCAGCCG 


214 


A209G 


AF 


gggacggtcggtagatCATTTCCACCTCACCAAA 


214 


A209G 


EF 


AGGTATTCCCGGCGTTTC 


214 


A209G 


ER 


GACGATGCCTTCA6CACATGTTGTGCGTCTGCTTCC 


214 


A209G 


GF 


gctggctcggtcaagaCATTTCCACCTCACCAAG 


221 


C339G 


CF 


SggacggtcggtagatTGTGAAGAACTGTTGCTC 


221 


C3396 


EF 


^TTGAAGCTCATCTGCCTTCT 


221 


C339G 


ER 


aaaTGCCTTCAGCACATCCCCTTCCTTCTTACCT 


221 ( 


C:339G 


GF < 


JctggctcggtcaagaTGTGAAGAACTGTTGCTG 


224 < 


:i89T 


CR < 


jggacggtcggtagatGCCCGCTTTTCTTCATC36 


224 < 


:i89T 


EF ( 


3ACGATGCCTTCAGCACACTGTCTTCAAGGGCTTACAC 
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^ Sequence . 




224 


C189T 


£R 


TCCAACTTCAGGCAAAAC- 




224 


C189T 


TR 


gctggctcggtcaagaGCCCGClTrrCTTCATCA 




294 


C465T 


CR 


gggacggtcggtagatCCCAAGGCCAACAGGGAG 




294 


C465T 


EF 


GACGATGCCTTCAGCACAGCATTCTTATGCCAGTGTTC 




294 


C465T 


ER 


ATCCATCCCATCCTGTGT 




294 


C465T 


TR 


gctggctcggtcaagaCCCAAGGCCAACAGGQAA 




307 


C215T 


CR 


gggacggtcggtagatGAGTGGGTGCTGTTCCCG 




307 


C215T 


EF 


GACGATGCCTTCAGCACAGTTACTGCCTCTCTGACC 


307 


C215T 


ER 


AGTGTGACCTGCTCTCTT 




307 


C215T 


TR 


gctggctcggtcaagaGAGTGGGTGCTGTTCCCA 




411 


A369T 


ER 


gacgatgccttcagcacaAACACATTCCCCCTCTAC 




411 


A369T 


EF 


GTCTCTATTCCAAGCCAAG 




411 


A369T 


AF 


gggacggtcggtagatCCCCGCTCCAGCTCCTCA 




411- 


A369T 


TP 


gctggctcggtcaagaCCCCGCTCCAGCTCCTCT 


449 


C323G 


CR 


gggacggt cggt agatCCGCTTCTGCTTCTGCTG 


449 


C323G 


EF 


GACGATGCCTTCAGCACAAGGAGAAGAGGGAG6AGA 


449 


C323G 


ER 


GGA6CACGTAAGGAGAAA 


449 


03236 


GR 


gctggctcggtcaagaCOGCTTCPGCTTCTGCTC 


466 ( 


!:i23T 


CF 


gggacggtcggtagatGGCCAGGGGCTGGAGGGC 


466 ( 


:i23T 


EF 


TCTTCAGTTCTCTCAGCTTC 


466 ( 


:i23T 


ER 


GACGATGCCTTCAGCACATCACTAGGGGCTCTTACC 


466 ( 


:i23T 


TF 


gctggctcggtcaagaGGCCAGGGGCTGGAGGGT 


472 p 


U97G 


AR 


gggacggtcggtagatTCCTCCCGCTGCTTCAGT 


472 b 


U97G 


EF 


SAaaTGCCriX:AGCACATCACTTAaXATCAlACTT^ 


472 b 


1497G 


ER i 


WITCCTGCCTCCCACCTT . 


472 p 


L4976 


GR < 


jctggctcggtcaagaTCCTCCCGCTGCTTCAGC 


542 b 


L402G 


AR < 


jggacggtcggtagatACSAAATTCCCTCCCAACT 


542 p 


L402G 




aAOaATGCCTTCAGCACATGATTGAGCCAGTTGTTT 


542 U 
542 L 


402G 


ER C 


3GGGTGTATTTTGAGAGT6 




402G 


GR c 


JctggctcggtcaagaAGAAATTCCCTCCCAACC 


739 C87G 


CR c 


99acggtcggtagatGCTGGTTTGACTGGA0GG 


739 C87G 


EF C 


SACGATGCCTl'CAGCACAACCITGGTATAATOCTTTCC 


739 C87G 


ER I\ 


LG(iCAACCTAATCCACTT 


739 C87G 


GR s 


fCtggctcggtcaagaGCTGGTTTGACTGGACGC 
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Sequence 


621 


A140C 


AF 


gggacggtcggtagatJ^TGCTGTGATACCTGGA 


821 


A140C 


CF 


gctggctcggtcaagaAGTGCTGTGATACCTGGC 


621 


A140C 


EF 


ACACCCACAAAACAAGAA 


821 


A140C 


£R 


GAOSATGCCTTCAGCACAGGAACAAGGACATAAWUSA^ 


1005 


A2576 


AR 


gggacggtcggtagatAGGAAATGTTAGCCCTGT 


1005 


A257G 


EF 


GACGATGCCTTCAGCACACTCCACTTCTCTATGCCTC 


1005 


A257G 


£R 


GTCCCCAGCTATGTATTGT 


1005 


A257G 


GR 


gctggctcggtcaagaAGGAAATGTTAGCCCTGC 


1055 


A287T 


AF 


gggacggtcggtagatCTCAGGGAGGGAGAGAGA 


1055 


A287T 


EF 


GGGACAGACAGACAGACA 


1055 


A287T 


ER 


GACXaATGCCTTOVGCACACAACTCCTTCTTC^^ 


1055 


A287T 


TF 


gctggctcggtcaagaCTCAGGGAGGGAGAGAGT 


1056 


A354G 


AR 


gggacggtcggtagatGCGGCTGCCCCGTCCTGT 


1056 


A3 546 


EF 


GAOSATGCCTTCAGCACAGTGTGTCTATGTGTCl^^ 


1056 


A354G 


ER 


CGGACTTCTCCTl'CrTGT 


1056 


A354G 


GR 


gctggctcggtcaagaGOGGCTGCCCCGTCCTGC 


1085 


A251G 


EF 


TAGGGTAAGCAGCAAGAG 


1085 


A251G 


ER 


CACAAGGCAAGAGATAACA 


1085 


A251G 


AF 


gggacggtcggtagatCAGGCAAGATAGACAGCA 


1085 


A251G 


GF 


gctggctcggtcaagaCAGGCAAGATAGACAGCG 


1086 


A104G 


EF 


GTGCCCATACX^AACAGAATAG 


1086 


A104G 


ER 


TGCCAA6TACCCCAAGAG 


1086 


A104G 


AR 


gggacggtcggtagatCCATTCCTCCCCAGACAT 


1086 


?V104G 


GR 


gctggctcggtcaagaCCATTCCTCCCCAGACAC 


1092 


C16876 


CF 


gggacggtcggtagatCGTGCX3AGCAGCX3AAAGC 


1092 


C:i6876 


EF 


CGAGAGAGAAGTCGAGGAAGAGA 


1092 ( 


:i687G 


ER 


GACGATGCCTTCAGCACAGTCACCCCCAAAAGCAGG 


1092 ( 


:i687G 


GF 


^ctggctcggtcaagaOGTGOGAGCAGCGAAAGG 


1096 ( 


?454T 


EF 


3A0GATGCCTTC»GCACACTTTTCCrCC^^ 


1096 ( 


;454T 


ER ; 


MUnt5ATG13UlCCCTCXrrCTC 


1096 ( 


M54T 


GR i 


sggacggtcggtagatTCAGCTATAAAXAGGGCC 


1096 C 


^4547 


TR < 


JctggctcggtcaagaTCAGCTATAAATAGGGCA 


1101 < 


:249T 


CR < 


jggacsfgtcggtagatTGATGGCX^GGTGCCAAGG 


1101 c 


:249T 


EF C 


SAOGATGCCTTCAGCACAGCTCTTTCCTl"^ 
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1101 


C249T 


ER 


CACTGGGGGTCCTCTTAC 


1101 


C249T 


TR 


gc t ggc t cggt caagaTGATGGCGGGTGCCAAGA 


1204 


A307G 


AR 


gggacggtcggtagatCAAGGGCACTCACATTAT 


1204 


A307G 


EF 


GAOSATGCCTTCAGCACAGCTCTTGCGTCTGTTTCC 


1204 


A307G 


ER 


TTTCCCTTCTGTCCCCTT 


1204 


A307G 


GR 


gc tggc t cggt caagaCAAGGGCACTCACATTAC 


1504 


C180T 


CF 


gggacggtcggta^atGTGACTTTTGGTTCCCAC 


1504 


C180T 


EF 


AACTCGGGGTCACTGGTCT 


1504 


C180T 


ER 


GACGATGCCTTCA6CACACAGCGGGTATGGAGGATG 


1504 


C180T 


TF , 


gctggctcggtcaagaGTGACTTTTGGTTCCCAT 


1511 


G153T 


EF 


ACACCAGTTCTCCCTCCT 


1511 


G153T 


ER 


GACGATGCCTTCAGCACACOCACCTTTCCTAATCCT 


1511 


G153T 


GF 


gggacggtcggtagatTTGGGACTCTGOGTCAAG 


1511 


G153T 


TF 


gctggctcggtcaagaTTGGGACTCTGCGTCAAT 


1524 


A284C 


AF 


gggacggtcggtagatCTCTCAAAGCCCACACAA 


1524 


A284C 


CF 


gctggctcggtcaagaCTCTCAAAGCCCACACAC 


1524 


A284C 


EF 


AGAAAAAGAAAAGGAAAAAGA 


1524 


A284C 


ER 


GACGATGCCTTCAGCACAGGAAAGTTACAAGGCTATGA 


1556 


C367G 


CR 


gggacggtcggtagatACCTGCCTCTAAGGTCTG 


1556 


C367G 


EF 


GACGATGCCTTCAGCACAAGGAGAAGACAGTTCAAGG 


1556 


C367G 


ER 


ACAGTTGCCAGAGAAAAG 


1556 


C367G 


GR 


gctggctcggtcaagaACCTGCCTCTAAGGTCTC 


1561 


A251C 


EF 


TCACTTGCCTCTACTCCA 


1561 


^51C 


ER 


ATACCAGAAAGACTAAGCTCC 


1561 


^51C 


AF 


gggacggtcggtagatGGGTGAGCTCTGTGGGCA 


1561 


^51C 


CF 


gctggctcggtcaagaGGGTGAGCTCTGTGGGCC 


1582 


C389T 


CR 


gggacggtcggtagatCCAAGGGTTATGGCAGGG 






EF < 


3ACGATGCClTCiu3CACACCIGACTATTTGGGGTTGTG 


1582 ( 


:389T 


ER 1 


^TOGCTCTCTGCTTCTGCT 


1582 .( 


:3e9T 


TR < 


jctggctcggtcaagaCCAAGGGTTATGGCAGGA 


1638 2 


V443G 


AR < 


jggacggtcggtagatCCAAAACCCCAGCXSCTGT 


1638 J 


U43G 


EF ( 


SAOSATGCCTTCAGCACACTCTTTATCCTGCTTATGGT 


1638 ; 


U43G 


ER ( 


:CAAGCTCACTCT6TAGG 


1638 ; 


1443G 


GR c 


IctggctcggtcaagaCCAAAACCCCAGCGCTGC 
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1662 


C251T 


EF 


AATACAATGGAAGCCAAG 


1662 


C251T 


ER 


CCTAATCGAACAGAAAGG 


1662 


C251T 


CF 


gggacggtcggtagatCCAGTCTCCATCCACTTC 


1662 


C251T. 


TP 


gctggctcggtcaagaCCAGTCTCCATCCACTTT 


1714 


A376G 


AF 


gggacggtcggtagatTGAACGGCATGACGGGGA 


1714 


A376G 


EF 


AAGTGTTTCTGCTGTGCCT 


1714 


A376G 


ER 


GACGATGCCTTCAGCACACAAGTCCTGGTTTTCCATC 


1714 


A376G 


GF 


gctggctcggtcaagaTGAACGGCATGACGGGGG 


1722 


C89T 


CF 


gggacggtcggtagatACCCCAGGATGCCCACAC 


1722 


C89T 


EF 


GTITATCCTCCTCATGTCC 


1722 


C89T 


ER 


GACGATGCCTTCAGCACAGTTACCTTTTCCACCTCTC 


1722 


C89T 


TF 


gctggctcggtcaagaACCCCAGGATGCCCACAT 


1757 


A210G 


AF 


gggacggtcggtagatGGAAACAAACCAAAATGA 


1757 


A210G 


EF 


CCAGCACCOVAAATAAGA 


1757 


A210G 


ER 


GACGATGCCTTCAGCACAATAAGTTGAAGCCCTCCC 


1757 


A2106 


GF 


gc t ggc t cggt caagaGGAAACAAACCAAAATGG 


1765 


A240G 


AF 


SggacggtcggtagatGGCTTCACGGAGGAAGAA 


1765 


b^40G 


EF 


TTAGGAGCTGTGAGGTATG 


1765 


A240G 


ER 


6ACGATGCCTTCAGCACATAAGATGGAGCAGGGTAG 


1765 


A240G 


GF 


gctggctcggtcaagaGGCTTCACGGAGGAAGAG 


. 1776 


A200G 


AF 


gggacggtcggtagatAAAGGGCTCCCAACACCA 


1776 


A200G 


EF 


TGAGCACAAGATCAGAGAGG 


1776 


;^00G 


ER 


GACGATGCCTTCAGCACAAGSCAGAGACGCAGGAATG 


1776 


A200G 


GF 


gctggctcggtcaagaAAAGGGCTCCCAACACCG 


1799 


C370T 


CF 


gggacggtcggtagatAGGGACAACCAAAGTGAC 


1799 


C370T 


EF 


^TCATCAGAACAGCCCTAC 


1799 


C370T 


ER 


SACGATGCCTTCAGCACACAAGCCCACCTACTTACTC 


1799 


C370T 


TF 


?ctggctcggtcaagaAGGGACAACCAAAGTGAT 


1806 i 


PaOlG 


AF i 


jggacggtcggtagatTGGGCGTCCTGGTGGGCA 


1806 1 


\201G 


EF • 


rCTTOQGGCTAACTCTTT 


1806 J 


\201G 


ER ( 


SACGATGCCTTCAGCACACTGKTlCTCCAAACCTTCr 


1806 J 


^OIG 


GF < 


jctggctcggtcaagaTGGGOGTCCTGGTGGGCG 


1837 ( 


MIST 


CF c 


?ggacggtcggtagatCTCAGCTTCAT6CAGGGC 


1837 C 


:413T 


EF ( 


:CCACTCAGCCCTGCTCTT 
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1837 


C413T 


ER 


GACGATGCCTTCAGCACAGCATCCTTGGCGGTCTTC 


1837 


C413T 


TF 


gc t ggc t cggt caagaCTCAGCTTCATGCAGGGT 


1870 


C323T 


CF 


gggacggtcggtagatCTCCTCATTGCCTCCTTC 


1870 


,C323T 


EF 


CACCTCTTTTCTCCTTCTCTT 


1870 


C323T 


ER - 


GACGATGCCTTCAGCACACCCACCCCCTCTATCTAC 


1870 


C323T 


TF 


gctggctcggtcaagaCTCCTCATTGCCTCCTTT 


1882 


CllST 


CR 


gg9acggtcggtagatGTCCCCCACAAGTCCTCG 


1882 


C115T 


EF 


GAOGATGCCTTCAGCACAGACCTGTACCCTTTACCC 


1882 . 


CllST 


ER 


TGTTTCCCTGTCTGTTTC 


1882 


CllST 


TR 


gctggctcggtcaagaGTCCCCCACAAGTCCTCA 


1988 


C214T 


CF 


gggacggtcggtagatGTGACTCGGTCCTATACC 


1988 


C214T 


EF 


GTGGGCTGTGATT6TCTT 


1988 


C214T 


ER 


GACGATGCCTTCAGCACATCTCGTCGTCGTAGTAGTTGT 


1988 


C214T 


TF 


gctggctcggtcaagaGTGACTCGGTCCTATACT 


2000 


C349T 


CR 


gggacggtcggtagatAGTATGGTAATTAGGAAG 


2000 


C349T 


EF 


GACGATGCCTTCAGCACACTGACACTGAGCCACAAC 


2000 


C349T 


ER 


AACTGATGAGCAAGAAGGA 


2000 


C349T 


TR 


gctggctcggtcaagaAGTATGGTAATTAGGAAA 


2071 


A338G 


AR 


gggacggtccgtagatAAAATTGTTTCCTGTGAT 


2071 


A338G 


EF 


GACGATGCCTTCAGCACACATTGCTATTCTCAGGCTATA 


2071 


A3386 


ER 


CCCATTCTCT6CTTGACAGT 


2071 


A3386 


GR 


gctggctcggtcaagaAAAATTGTTTCCTGTGAC 


2078 


G876T 


EF 


CCAGAGAGGGGATAAAGA 


2078 


G876T 


ER 


GACGATGCCTTCAGCACAGAGTGTCAAGAGGAACAGG 


2078 


G876T 


GF 


gggacggtcggtagatTGGCTGCTGAGGTCTGAG 


2078 


G876T 


TF 


gctggctcggtcaagaTGGCTGCTGAGGTCTGAT 


2085 


G415T 


EF 


SCTTTTtCTTTTCATTACATC 


2085 


G415T 


ER 


SAOSATGCCTTCAGCACACCTCTTTTAGAATCAGAGACA 


2085 


S415T 


GF i 


?99LacggtcggtagatGGTA6T6TTACCAGAAA6 


2085 


841ST 


TF < 


?Ctggctcggtcaaga6GTAGT6TTACCAGAAAT 


2095 


M06G 


AR ( 


srggacggtcggtagatTGTGCACCGGGATATTTT 


2095 


^406G 


EF ( 




2095 


^406G 


ER C 


3GTGTTTCTCCTCCTCTCT 


2095 I 


^406G 


GR c 


jctggctcggtcaagaTGTGCACCGGGATATTTC 
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2119 


k676 


AR- 


gggacggtcggtagatGTGGGCACCAAACGCTAT 


2119 . 


bv67G 


EF 


GACGATGCCTTCAGCACAGATGTAGGGCTGGAAGTG 


2119 


U67G 


ER 


TCAAGAAAAATGGGAGTTG 


2119 


UiSlG . 


GR 


gctggctcggtcaagaGTGGGCACCAAACGCTAC 


2141 


pVl 7 6G 


EF 


TGTAGCATCGGTAGGTTC 


2141 


kl76G 


ER 


CAACATCAGAC'ilTCnTTTC 


2141 


A176G 


AR 


gggacggtcggtagatTGGTACAGGGCTAGTTTT 


2141 


Ul76G 


GR 


gctggctcggtcaagaTGGTACAGGGCTAGTTTC 


2182 


A318G 


AF 


gggacggtcggtagatAGGCGGGCCAAGGGTGAA 


2182 


A318G 


EF 


ITTCTCTCTCCCCTTCTGT 


2182 


U3I8G 


ER 


GAOSATGCCTTCAGCACATAAATGTTCACTCTTCTT^ 


2182 


k318G 


GF 


gctggctcggtcaagaAGGCGGGCCAAGGGTGAG 


2234 


G296T 


EF 


GGGTTGTTGCAGGGCGCTATT 


2234 


|g296T 


ER 


GACGATGCCTTCAGCACATGTGGAGAGGCCGQGTGC | 


2234 


G296T 


GF 


gggacggtcggtagatGAACCAGCCCCCTGGAAG ] 


2234 


G296T 


TF 


gctggctcggtcaagaGAACCAGCCCCCTGGAAT 


2281 


U227C 


AR 


gggacggtcggtagatCAGGCTTGGAGACCTGGT 


2281 


k227C 


CR 


gc tggc t cggt caagaCAGGCTTGGAGACCTGGG 


2281 


k227C 


EF 


GACGATGCCTTCAGCACAGGGTATTCAGTTGGAAGG 


2281 


[a227C 


ER 




AGGCAAGGTTCTTAGTTG 


2298 


A77C 


AR 


gggacggtcggtagatTCTAAAAGCACTTGAAAT 1 


2298 


A77C 


CR 


gctggctcggtcaagaTCTAAAAGCACTTGAAAG 


2298 


A77C 


EF 


GACGATGCCTTCAGCACACCTGCTAGTGTTTTCTGG 


.2298 


A77C 


ER 


TGTAACTGATAGGTGGTGG 1 


2341 


C286T 


CR 


gggacggtccgtagatTGAAGATTCTGCTCASCG 


2341 


C286T 


EF 


GACGATGCCTTCAGCACAAGGGCCOGGGACTCAT 


2341 


C286T 


ER 


TTTGGGGTCCTGOGGATG "1 


2341 • 1 




TR 


gctggctcggtcaagaTGAAGATTCTGCTCAGCA 


2357 L 
2357 L 


^165G 


AF 


gggacggtcggtagatCAAAGAAGACGAAAATGA ] 




M65G 


EF 


CTCAA5TTT6TTACTGATTTCTC "| 


2357 L 
2357 L 


U65G 


ER 


GACGATGCCTTCAGCACAGGGTTACGTCTGCTCTTC • 




U65G 


GF 


gctggctcggtcaagaCAAAGAAGT^CGAAAATGG | 


2366 C 


350T 


EF 


GACGATGCCTTCAGCACACTGCTCCGAAACACGGTC 


2366 C 


350T 


ER 




lATCTTCAGCCCTTCTTACTCT 1 
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2366 


G50T 


GR 


gggacggtcggtagatCTCCTGGGCACCAOGGGC 


2366 


G50T 


TR 


Igc t ggc t egg t caagaCTCCTGGGCACCACGGGA 


2995 


A299C 


£R 


gacgatgccttcagcacaTGGGATTAGACACGAGAG 


2995 


A299C 


EF 


kAAGAACTGGAAGAAGGAA ' " 


2995 


A299C 


AF 


gggacggtcggtagatGTCACCTCCTTTCCACTA 


2995 


A299C 


CP 


Igc t ggc t egg t caagaGTCACCTCCTTTCCACTC 


3360 


G777T 


£R 


IgacgatgccttcagcacaAGAAAAATGAGAGGGAAAAC 


3360 


G777T 


EF 


GATGAAGGGAAATGGAAC 


3360 


G777T 


OF 


gggacggtcggtagatCCAACTATATAGGAGCCG 


33€0 


G777T 


•TF 


gctggctcggtcaagaCCAACTATATAGGAGCCT - 


3464 


AllOG 


EF 


CTX3AACCGAGGAGATTTTT 


3464 


MlOG 


ER 


TGATGCTTACAGAACTGGG 


3464 


AllOG 


AF 


gggacggtcggtagatGTGTAGTGGGCAGGGTTA 


3464 


AllOG 


GF 


gctggctcggtcaagaGTGTAGTGGGCAGGGTTO 


3975 


A65C 


EF 


gacgatgccttcagcacaAAAAGAACCCTGGTGAAG ' 


3975 


A65C 


ER 


CCCTGATAAAAGAGATGGA 


3975 


A65C 


AR. 


gggacggtcggtagatCGCATGGGAGTCAGGGAT 


3975 


A65C 


CR 1 


gctggctcggtcaagaCGCATGGGAGTCAGGGAG 


3976 . 


A239G 


-EF 


gacgatgccttcagcacaATGAGGGAGCAAGACAAG 


3976 


A239G 


ER 


TGATAAAAGAGAT6GAAGGAG 


3976 


A239G 


AR 1 


gggacggtcggtagatGTCACTGTTTGTCACTGT 


3976 


A239G 


GR 1 


gctggctcggtcaagaGTCACTGTTTGTCACTGC 


4206 


A304T 


EF 


gacgatgccttcagcacaCTTTTTAGCCAAGTGGAG • 


4206 


A304T 


ER \{ 


GGATCTGAGGAATCTGTG 


4206 


^04T 


AR IgggacggtcggtagatACCAGGCAGAGAGAAAAT 


4206 


!^04T 


TR gctggctcggtcaagaACCAGGCAGAGAGAAAAA 


4912 


^74G 


EF CITCACTGAGCX3TCCXXAGAG 


4912 


\74G 


BR CCX5TC6GCCCX3ATTCA 


4912 } 


V74G 


AR CAGGCGAGCCTCAGCCCT 


4912 J 


^746 


GR ( 


::aggcgagcctcagcccc 


4925 ; 


V251C 


EF n 


PCATTTCCCAATTXACXTCC | 


4925 ; 


V251C 


ER C 


:crCTTTCCCATCTCCCT 


4925 ; 


^51C 


AF c 


rggacggtcggtagatAGCCAGGAGCCTGCGTCA 


4925 ; 


L2.S1C 


CP c 


rctggctcggtcaagaAGCCAGGAGCCTGCGTCC . | 
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4966 


A2S1G 


EF 


CATTGCTCTTCCTCTCTGT . " 


4966 


A251G 


ER 


GTGTCATCATTCCTTTCTTG 


4966 


A251G 


AR 


gggacggtcggtagatTCAGAGACATGAGTCCAT 


4966 


A251G 


GR 


gctggctcggtcaagaTCAGAGACATGAGTCCAC 


5014 


A2057G 


\ ER 


gacgatgccttcagcacaCACCTGTCCCACCCTATTT 


5014 


A2057G 


EF 


GTCCTGAACGCCCATTCT 


5014 


A20S7G 


AF 


gggacggtcggtagatGCCTGCTVCTGCGTTCCTA 


5014 


A2057G 


GF 


gctggctcggtcaagaGCCTGCACTGCGTTCCTG 


5296 


A251G 


EF 


GCTCCTCTGCCTTCTGCTl' 


5296 


A251G 


ER 


ATTTGCCCACTGCCCTTC 


5296 


A251G 


AF 


gggacggtcgsftagatTGGCTGCAGGTGGGTCCA 


5296 


A251G 


GF 


gctggctcggtcaagaOX^GCTGCAGGTGGGTCCG 


5298 


C172T 


EF 


GCCACACACACXTTTAACA 


5298 


C172T 


ER 


AAAGTTCTCTGCCTCCAA 


5298 


C172T 


CP 


gggacggtcggtagatAGCTCTCAGCTGGGGTGC 


5298 


C172T 


. TF 


gctggctcggtcaagaAGCTCTCAGCTGGGGTGT 


5457 


A134G 


EF 


AGCAGAATG6GCAATAGA 


5457 


A134G 


ER 


AGAGATGTGGGCAGAGAA 


5457 


A134G 


AF 


gggacggtcggtagatGGAAAGCCTACTTTCTTA 


5457 


A134G 


GF 


gctggctcggtcaagaGGAAAGCCTACTTTCTTG 


5704 


C61T 


EF 


ACAGCCATAACAGGAGTG 


5704 


C61T 


ER 


GGGTTACTCAACCrAAGAGA 


5704 


C61T 


CR 


gggacggtcggtagatGTTCTCTTTGGGAAAACG 


5704 


C61T 


TR 


gctggctcggtcaagaGTTCTCTTTGGGAAAACA 


5717 


jaseoG 


EF 


gacgatgccttcagcacaGAACAGAAACCACAGAACC 


5717 


M.960G 


ER 


6TCCCACCCTATTTTGAG 


5717 


M.960G 


AR 


gggacggtcggtagatCACTGGCCCACCTCCCTT 


5717 


ra960G 


GR 


gctggctcggtcaagaCACTGGCCCACCTCCCTC 


5959 


^710 


EF 


gacgatgccttcagcacaACCATGCCTGACTXAACC 


5959 i 


^71G 


ER ' 


rTGTTTCCTGTCCTCTTTC 


5959 i 


i71G 


AR i 


jggacggtcggtagatGTTAAGAGGCTOGGCACT 


5959 I 


171G 


GR < 


JctggctcggtcaagaGTTAAGAGGCTGGGCAGC 


6162 C 


:340G 


EF < 


jacgat'gccttcagcacaAGTGTTGTTAGGAGCAAAG 


6162 ( 


:340G 


ER ( 


nTAGGAAACTGAGGTGG 
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6162 


C340G 


CR 


gggacggtcggtagatCTGCAGCCTGGGCAACAG 


6162 


C340G 


•GR 


gctggctcggtcaagaCTGCAGCCTGGGCAACAC 


6236 


C906T 


ER 


gacgatgccttcagcacaTGGACACATTTGAGCTTT 


6236 


C906T 


EF 


CTTCCCCAGAGATGACTAC 


6236 


C906T 


CF 


gggacggtcggtagatCCCCATCCTACTCAGCAC 


6236 


C906T 


TF 


gctggctcggtcaagaCCCCATCCTACTCAGCAT 


6744 


C348T 


ER 


gacgatgccttcagcacaGGTTACAGTGAGCCAAGA 


6744 


C348T 


EF 


AGGTGAAOAAAGCAAAATAC 


6744 


C348T 


CF 


gggacggtcggtagatTGGTGTGTGTTTTGTTTC 


6744 


C348T 


TF 


gctggctcggtcaagaTGGTGTGTGTTTTGTTTT 


7133 


C636 


EF 


TTGAGACCCTACAGASCCA . 


7133 


C63G 


ER 


GGCAAGCTGAGGTGAAAG 


7133 


C63G 


CR 


999acggtcggtagatAATAAGGTAA6AAATGAG 


7133 


C63G - 


GR 


gctggctcggtcaagaAATAAGGTAAGAAATGAC 


8210 


A251G 


EF 


TAATTTCTAATGGCCTTCC 


8210 


A251G 


ER 


TCACTTACTCCCTGATGTCT 


8210 


A251G 


AR 


gggacggt cggt agat CATTGGGTTTTCCCTCAT 


8210 


A251G 


GR. 


gc t ggc t cggt caagaCATTGGGTTTTCCCTCAC 


8592 


C46T 


- ER 


gacgatgccttcagcacaACATTTAGTGCCAACATCAC 


859.2 


C46T 


EF 


CTCTTCCCTGAGACACCA 


8592 


C46T 


. CF 


gggacggtcggtagatGAAGGTGAAGGCCAGAGC 


8592 


C46T 


TF 


gctggctcggtcaagaGAAGGTGAAGGCCAGAGT 


8943 


A251C 


EF 


GAGGCTGAGACAGAAGAA .. v 


8943 


A251C 


ER 


GTTTGACATTAAAGAAAATGAG 


6943 


A251C 


AR 


gggacggtcggtagatGGCTGGAGTGCAGTGATT 


8943 


h2SlC 


CR 


gctggctcggtcaagaGGCTGGAGTGCAGTGATG 


9193 


C88G 


EF 


CACGCTGTTGAGTGGG 


9193 


C88G 


ER 


OGCAGGTCTACGCnXA 


9193 


C88G 


CR 


399acggtC9gtagatCCC6GGTCT6AGGCT6CG 


9193 


C88G 


GR < 


jctggctcggtcaagaCCCGGGTCTGAGGCTGCC 


9516 


M.87G 


EF ( 


::acacacacacacacacac 


9516 


M87G 


ER ( 


3GTCCCTTACTTTCCTCTT 


9516 . i 


WL87G 


AR ( 


jggacggtcggtagatCCTATCCCTACTTCCCCT 


9516 i 


a87G 


GR c 


jctggctcggtcaagaCCTATCCCTACTTCCCCC 
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151G 



EF GTGACCCCAAAAGAGAGA 



9698 



ISIG 



ER CTAGCTTGTTACTGCCTCC 



9698 
9698 



>51G 



AF gggacggtcggtagatGGCACGACCCCGCCCCCA 



iSlG 



GF gctggctcggtcaagaGGCACGACCCCGCCCCCG 



9883 



24 9G 



EF TCCACAACCTCAAAACCAC 



9883 
9883 



249G 



ER CACAGTCCTGCAAGCTCA 



149G 



AR gggacggtcggtagatCCGTGGCCGTGGCTCACT 



9883 



M9G 



GR IgctggctcggtcaagaCCGTGGCCGTGGCTCACC 



10481 



L07T 



ER gacgatgccttcagcacaGTTCGGGGCTCCACTT 



10481 
10481 



L07T 



EF TAGCX3GGACAG06CT6 



L07T 



AF gggacggtcggtagatCCCGGCGCGCCTOGGAGA 



10481 
10542 



a07T 



TF gctggctcggtcaagaCCCGGCGCGCCTOGGAGT 



C367T 



EF gacgatgccttcagcacaAATACACTGGGTCCTGCT 



10542 



10542 
10542 



C367T 



ER 



VTAOTGCTGGCCrrrCTC 



C367T 



CR gggacggtcggtagatGGTCAGGGGAGCCCAGAG 



C367T 



TR gctggctcggtcaagaGGTCAGGGGAGCCCAGAA 



10600 



ISIG 



EF CCTGGCAACTAACCTCTT 



10600 



10600 
10600 



ISIG 



ER 



rrCTCTGTCTACTC 



251G 



AR gggacggtcggtagatATTGGCCCTGCTCAGGAT 



J51G 



GR gctggctcggtcaagaATTGGCCCTGCTCAGGAC 



10621 



10621 
10621 



C402T 



EF CCAGCCCTAAACCTAAA 



C402T 



ER 



^CCTCTCAAGATCAGACAC 



C402T 



CF gggacggtcggtagatTTAGCACTTAATAAGTAC 



10621 



C402T 



TF gctggctcggtcaagaTTAGCACTTAATAAGTAT 



10745 
10745 



251G 



EF CCCCACAACAAAGAAAGA 



251G 



ER GAA6CCAACTCTCCAACA 



10745 
10745 



251G 



AF 



gggacggtcggta^atCAAGGATTTCAAAAACCA 



^51G 



GF gctggctcggtcaagaCAAGGATTTCAAAAACOG 



10771 



C64G 



IgacgatgccttcagcacaCCAGGGAAGAGCAGAACC 



10771 
10771 



C64G 



ER TGTACGG6AAGAGGCAGA 



C64G 



CR 



gggacggtcggtapatAGGGTGACACAGGCCAOG 



10771 



10870 
10870 



C64G 



GR gctggctcggtcaagaAGGGTGACACAGGCCACC 



ISIG 



EF ^TCCCATCCCAACACACA 



251G 



ER CCGAGACCAAACTCATTCAC 
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1 J.UO / U 


pi251G 


AR 


gggacggtcggtagatGGCAGAGCCTGAGTCACT 


IXU'O /U 


|A2516 


GR 


gctggctcggtcaagaGGCAGAGCCTGAGTCACC 


IXUo / / 


iiv o c 1 ^ 
|A251C 


EF 


CCT6TTTCTCAACCTTCTC 


ll 00*7*7 


p^51C 


ER 


ATGGTCTATGGAACCTAATCT 




Ilk O P ^ ^ 

|A251C 


AF 


gggacggtcggtagatGCACTGATTCTGCTTCCA 


I1Uq77 


|A251C 


CP 


gctggctcggtcaagaGCACTGATTCTGCTTCCC 


M AOJI CI 

I1094D 


.1614 OT 


EF 


AAGGACA666TCAGGAAAG 


1 1 #\ Ail a * 


|G140T 


ER 


CA6AG6GAGGAAGGAGGT 


110948 


1614 OT 


GF 


gggacggtcggtagatATGGAGGAGGGTGTCTGG 


1^ A A il A 

110948 


1614 OT 


TF 


gctggctcggtcaagaATGGAGGAGGGTGTCTGT 


11001 


C286T 


EF 


gacgatgccttcagcacaTTCCCAAAGACCCACA 


11001 


C286T 


ER 


CCTCCACCGCTATCAC " 


11001 


C286T 


CR 


gggacggt cggtagatTGGCTGCAGGACGTrCCAG 


11001 


C286T 


TR 


gctggctcggtcaagaTGGCTGCAGGACGTCCAA 


11001 


C286T 


EF 


TTCCCAAAGACCCACA 


11001 


C286T 


ER 


CCTCCACCGCTATCAC 


11001 C286T 


CR 


gggacggtcggtagatTGGCTGCAGGACGTCCAG 


11001 


C286T 


TR 


Z 7 ^ — 

gctggctcggtcaagaTGGCTGCAGGACGTCCAA 


11073 


C2156 


EF 


CCCAACCACCOGTTCC 


11073 


C21SG 


ER 


6CGC6GGAGCTAGAGA 


11073 


C215G 


*.CF 


gggacggtcggtagatGAAGCTGCGGGCCGGACC 


11073 


C2156 


GF 


gctggctcggtcaagaGAAGCTGCGGGCCGGACG 


11153 


C116T 


EF 


CGAGTGGGAAGAAAAGTAGA 


11153 


C116T 


ER 


ATGACTGCCTGCCTAGAA 


11153 


|cil6T 


CR 


?99acggtcg9tagatAAGATAGGGTAGAGGCC6 


11153 


jcil6T 


TR < 


JctggctcggtcaagaAAGATAGGGTAGAGGCCA 


11210 


C194T 


EF ( 


3AGGA6TGAGGGAAAGTAAG 


11210 


C194T 


ER I 


kAATGGAGAGAfiA<!n3fiG& 


11210 


C194T 


CF c 


jggacggtcggtagatCCAGGAAATGACATGATC 


11210 


C194T 


TF c 


jctggctcggtcaagaCCAGGAAATGACATGATT ' * 


11248 


C225T 


EF 1 


X5A6TTGAACAGCACTTGG 


11248 


C225T 


ER I 


^6GGTAAGGGAGGGAAAA 


11248 


C225T 


CR c 


iggacgigtcggtagatTGATTCTTTCGCTTGGCG 


11248 |C225T 


"TR^i 


[ctggctcggtcaagaTGATTCTTTOGCTTGGCA 



wo 2004/018709 



PCT/EP2003/009126 



-123- 



baySNP 


SNP 


Name 


Sequence 


11372 


A251G 


EF 


TAGAAAAGAAGAAAAATCAA • 


11372 


A251G 


ER 


ACACACACACACACACAC 


11372 


A251G 


AR 


gggacggtcggtagatCATCACCTTTTAGTTTCT 


11372 


A251G 


' GR 


gctggctcggtcaagaCATCACCTTTTAGTTTCC 


11449 


C251G 


EF 


ACAGAAGAACAACAACAAAAC 


11449 


C251G 


ER 


TGCGTATGAGGTAAAGAGA 


11449 


C251G 


CP 


gggacggtcggtagatATGAGTGAAGCCTGTCTC 


11449 


C251G 


GF 


gctggctcggtcaagaATGAGTGAAGCCTGTCTG 


. 11450 


A251T 


EF 


ACAGAAGAACAACAACAAAAC 


11450 


A251T 


ER 


TGCGrrATGAGGTAAAGAGA 


11450 


A251T 


AR 


gggacggtcggtagatGGACCATAATCTTGAAGT 


11450 


A251T 


TR 


gctggctcggtcaagaGGACCATAATCTTGAAGA 


11470 


C251T 


EF 


GCntviCTTOTCTGATAGGTG 


11470 


C251T 


ER 


CAACGTGAGAATTTCCAAAAT 


11470 


C251T 


CR 


gggacggtcggtagatTGAGAATTTCCAAAATAG 


11470 


C251T 


TR 


gctggc t egg t caagaTGAGAATTTCCAAAATAA. 


11472 


A251T 


EF 


TACATTCAAGGCAAGAAAA 


11472 


A251T 


ER 


TGATTAGTTACAATTACCTCTAGTATC 


11472 


A251T 


AF 


93gacggtcggtagatAGTTTGTCAGTAAATCTA 


11472 


A251T 


TF 


gctggctcggtcaagaAGTTTGTCAGTAAATGTT 


11487 


A485T 


EF 


gacga t gcc 1 1 cagcacaAGAGAGCAGCTAGACTGAGA 


11487 


k485T 


ER 


TTGCT6CAAACAGTTGAG 


11487 


A485T 


AR 


gggacggtcggtagatJVGTTGAGGGCTCAGGATT 


11487 


A485T 


TR 


gctggctcggtcaagaAGTTGAGGGCTCAGGATA 


11488 


C533G 


EF 


gacgatgccttcagcacaAGAGAGCAGCTAGACTGAGA 


11488 


C5336 


ER 


GTAAATAAAATGGGATGGTG 


11488 


C533G 


CR 


gggacggtcggtagatGCCCCAGCAAGCTGGATG 


11488 


C5336 


GR 


gctggctcggtcaagaGCCCCAGCAAGCTGCATC 


11493 


A171G 


EF 


CCTTTTGTGTTTTGTTTTGT 


11493 


U71G 


ER 


CTTCTCCACCtTCCATTC 


11493 


M71G 


AF 


SrggacggtcggtagatGGGAACTCCTAAATCAAA 


11493 


M71G 


GF i 


JctggctcggtcaagaGGGAACTCCTAAATCAAG 


11502 ( 


:4S5T 


EF < 


jacgatgccttcagcacaACGATGGGGTCAGAGTCA 


11502 ( 


:455T 


ER ( 


:CTACATTTCACACACGAACA 
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11502 


C455T 


. CR 


gggacggtcggtagatACACACTCCTCTCTCAAG 


11502 


C455T 


j TR 


gctggctcggtcaagaACACACTCCTCTCTCAAA 


11534 


G258T 


EF 


GCCATCGTCTTTCCCT 


11534 


G258T 


ER 


TCCTCCCTCCTTCTCTCT 


11534 


G258T 


GR 


gggacggtcggtagatCCTCCACCCACCAGGGCC 


11534 


G258T 


TR 


gctggctcggtcaagaCCTCCACCCACCAGGGCA 


11537 


A251G 


EF 


CCTCl-i-A^CTCuTCCTCTTC 


11537 


A251G 


1 ER 


CTCTTCCTGTCTTCTCGTCT 


11537 


k251G 


AF 


gggacggtcggtagatAGATGGACCTCTACAGGA 


11537 


A251G 


GF 


gctggctcggtcaagaAGATGGACCTCTACAGGG 


11560 


A185G 


EF 


CTCCTCCAACTCCTTTAC 


11560 


klBSG 


ER 


ATACTTCTCACTGCATCCT 


11560 


pV185G 


AR 


gggacggtcggtagatCCTGTCCCCTCCCTAGTT 


11560 


pa85G 


GR 


gctggctcggtcaagaCCTGTCCCCTCCCTAGTC 


11594 


C251T 


EF 


CACCTTCCTGAACTCACTC 


11594 


C251T 


ER 


TGATGTCTGTGCTGTCCT 


11594 1 


C251T 


CR . 


gggacggtcggtagatTCTGGTCCACTCAAGGAG 


11594 


C251T 


TR 


gctggctcggtcaagaTCTGGTCCACTCAAGGAA 


11624 


C251T 


EF 


TCGGGAGGTGTAAGTAAG 


11624 


C251T 


ER 


CCACAGTCAGAAGAGACAA 


11624 


C251T 


CR 


gggacggtcggtagatAGAGACCCTGGTCCCAAG 


11624 


C251T 


TR 


gct^gctcggtcaagaAGAGACCCTGGTCCCAAA 


11627 


C251T 


EF 


TTTATCACTACACCCCCTACTC 


11627 


C251T 


ER 


GACAGACCGACCAATCAC 


11627 ( 


C:251T 


CR 


gggacggtcggtagatCCCTGGGAAGGTTGAGAG 


11627 ( 


2251T 


TR 


gctggctcggtcaagaCCCTGGGAAGGTTGAGAA 


11650 U 


U46G 


EF 


CTGTCTGTTTGGGTCTTC 






J&K \ 


1 IXJU. CTGl'CCACT 


11650 L 


y.466 


AR J 


jggacggtcggtagatGGCCAAATGTCTAAAAGT 


11650 U 


V146G 


GR c 


JctggctcggtcaagaGGCCAAATGTCTAAAAGC 


11654 b 


1251G 


EF C 


X3TATCTCTTGCCTTTCTT 


11654 b 


^51G 


ER C 


nrCTCTTATGCCTTCCC - 


11654 p 


^51G 


AF c 


jggacggtcggtagatTTACTTGAAAGGACACCA 


11654 p 


L251G 


GF c 


IctggctcggtcaagaTTACTTGAAAGGACACCG 
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11655 


A251C 


EF 


CGTATCTCTTGCCTTTCTT • 


11655 


A251C 


ER 


CTTCTCTTATGCCTTCCC 


11655 


A251C 


AF 


gggacggtcggtagatTTCTGCACTAAAGCTGTA 


11655 


A251C 


CP 


gctggctcggtcaagaTTCTGCACTAAAGCTGTC 


11656 


C251T 


EF 


TGGGAAGAAAAAGAGAAG 


11656 


C2S1T 


ER 


GTTGAAACACTGCACAAG 


11656 


C251T 


CR 


gggacggtcggtagatCAGGGCTGTTGGGTGAAG 


11656 


C251T 


TR 


gctggctcggtcaagaCAGGGCTGTTGGGTGAAA 


11825 


A277G 


ER 


gacgatgccttcagcacaTGAATAGACAGGGACGAA 


11825 


A277G 


EF 


GACCTTGGAAATAATGGAG 


11825 


A277G 


AF - 


gggacggtcggtagatCA^CCCAGCAAAMTGGA 


11825 


A277G 


GF 


gctggctcggtcaagaCAACCCAGCAAAAATGGG 


11914 


A246T 


EF 


gacgatgccttcagcacaTTGGAAGTGAGATAAGATAGGT 


11914 


A246T 


ER . 


ACGGTGAGAATGAGAGGT 


11914 


A246T 


AR 


gggacggtcggtagatJUJUlCAGACAIt::^^ 


11914 


A246T 


TR 


gctggctcggtcaagaAAAACAGACArCAGAGGA 


12097 


A411G 


ER 


gacgatgccttcagcacaGATGAAACCCTGTCTCTACT 


12097 


A411G 


EF 


TTATCAACCTTAGTCTCCCT 


12097 


A411G 


AF 


gggacggtcggtagatACCTGCCACCACACCCAA 


12097 


A411G 


GF 


gctggctcggtcaagaACCTGCCACCACACCCAG 


12366 


A4126 


ER 


gacgatgccttcagcacaGCTGATGTGGTTGTGAG 


12366 


A412G 


EF 


GTTCCTGTAGCTCGTGTAG 


12366 


A412G 


AF 


gggacggtcggtagatCTCCCCGCCCTGCAGCAA 


12366 


A412G 


GF 


gctggctcggtcaagaCTCCCCGCCCTGCAGCAG 


12619 


^5G 


ER 


gacgatgccttcagcacaTGGCTGGACTTTGACTGATA 


12619 


^5G 


EF 


rCTTGTTTGTGTCACAGTGC 


12619 


Pi2SG 


AF 


gggacggtcggtagatOXSTGTCACAGTGCTCTGA 


12619 


^5G 


GF 


jctggctcggtcaagaTGTGTCACAGTGCTCTGG 


13025 


I1585C 


EF 


^acgatgccttcagcacaTTTAAGTAACATGACAAACTC 


13025 J 


\585C 


ER i 


^TCXGATAACTGAGCAGG 


13025 i 


1585C 


AR < 


jggacggtcggtagatcrATTAAGTAACTGGTGT 


13025 } 


1585C 


CR c 


jctggctcggtcaagaCTATTAAGTAACTGGTGG 


13191 ; 


V504G 


ER c 


jacgatgccttcagcacaATTCTCCCATTTCTCCTGT 


13191 } 


1504G 


EF I 


rcCCTCTTCTCCTCATTC 
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13191 


A504G 


AF 


999acggtcggtagatCCCTAATGTCTTCCTCTGA 


13191 


A504G 


GF 


get ggc t egg t caagaCCCTAATGTCTTCCTCTGG 


900045 


C116T 


EE 


ATCTCCTGATCCAAGTCC 


900045 


C116T 


ER 


CACACTGTGCCCATCTAC 


900045 


C116T 


CR 


gggaeggtcggtagatCTGACTGATTACCTCATG 


900045 


C116T 


TR 


gctggctcggtcaagaCTGACTGATTACCTCATA 


900078 


A2516 . 


EF 


CATAG6TAAASATCTGTAGGTG " 


900078 


A2516 


ER 


CCACCTTGGAAGTTGGCAAA 


900078 


A251G 


AR 


gggacggtcggtagatattaaatcgcctctctcT 


900078 


A2516 


GR 


gctggctcggtcaagaattaaatcgcctctctcC 


900107 


C426T 


ER 


gacgatgccttcagcacaAGGGCTTTTTCAGGTAGA 


900107 


C426T 


EF 


GACCTTTCCTGGGTAGAA 


900107 


C426T 


CF 


gggacggtcggtagatACTCTGAACCTGGGGGAC 


900107 


C426T 


TF 


gctggctcggtcaagaACTCTGAACCPGGGGGAT 


10000002 


A1036 


AF 


gggacggtcggtagatGATCAACACAATCTTCAA 


10000002 


A103G 


EF 


CAGCTGATkAGAGATGAAATTTACT 


10000002 


A103G 


ER 


GACGATGCCTTCAGCACAAACTTATGAAGATTAAG^ 


10000002 


A103G 


GF 


getggetcggtcaagaGATCAACACAATCTTCAG 


10000006 


G107A 


AF 


getggctcggtcaagaGGGCTGGGCTGCTAGGGA 


10000006 


G107A 


EF 


AGACGAGTTCAAGGTGACTG 


10000006 


G107A 


ER 


GACGATGCCTTCAGCACACCAAGTTTCCGAGTTTCC 


10000006 


S107A 


GF 


gggacggtcggtagatGGGCTGGGCTGCTAGGGG 


10000014 


A153C 


AF 


gggacggtcggtagatGTACCAATACATCCTGCA 


10000014 


^53C 


CP 


gctggctcggtcaagaGTACCAATACATCCTGCC 


10000014 


M53C 


EF ( 


ntSCTGATGTCTCTGTTG 


10000014 i 


U53C 


BR ( 


aCGATGCCTTCAGCACAGACTTACTTTGCTCACACTT 


10000025 ( 


:291T 


CF 


jggacggtcggtagatCCTCACTTCCTCAACGCC 


10000025 ( 


:291T 


EF C 


:CTCTCTGTCTG6'rrATCTTG 


10000025 ( 


:291T 


ER C 


SACGATGCCTTCAGCACAA6TGTGCCTCCTGGTTAG 


10000025 ( 


:291T 


TF c 


jctggctcggtcaagaCCTCACTTCCTCAAOGCT 
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Table2b Oligonucleotide primera used for genotyping using Fyro- 
sequencing 

The baySNP number refers to an internal numbering of the PA SNPs. Primer 
sequCTces are listed for preamplification of the genomic fiagmcals and for 
sequencing of the SNP using the pynosequencing mcflioi Bio: Biotinylirted 
Oligonucleotide. 



BaySNI 


* Name 


Sequence 


2995 


Primer P 


GCCAAGACTAGGAAGTAAGTGT 


2995 


Primer R 


Bio- CCCAGAACCACAAAGCTAGTAA 


2995 


Seq. 


TGCCCTGGTCACCTCCnrCC 


3689 


Primer F 


BIO-CTGACCCTGACCTTCATACTCAA 


3689 


Primer R 


AGAAGAT^GAAGCCTCTCTACAGTT 


3689 


Seq. 


AACAGATCAGGTTGGTG . 


4838 


Primer F 


Bio-CAAAGATGACCTTATGGCTCTGA 


4838 


Primer R 


GTCTCGGAACATGACCTTTAGT 




oeq. 


TGACTAAGAATGTAATGGGGAAGA 


6498 


Primer F 


CTTltS'lXKaATCTTTCTGCGGTGT 


6498 


Primer R 


Bio-CCATGTTGAGGAGCCCAGAGTGA 


6498 


Seq, 


ATTACAGTTGTGAGATTGTGC 


8021 


Primer F 


GGCCTTCTATGTACTAGGCG 


8021 


Primer R 


Bio-CTCTTTCTGGAGGCATCAATC 


8021 


Seq. 


CACAGGGAGACCCC 


8060 


Primer F 


Bio-GCCTTATTTTCCACTCCCACCT 


8060 


Primer R 


TACCTTTCCCCATCCCAACTG 


8060 


Seq. 


TCAGCATATGTTTGGATT 


8846 


Primer F 


MTTGAGAGAAGGTAGGGT 


8846 ] 


Primer R 


BIO-TTTGTTACTCTGTAGCCA 


8846 i 


5 q. j 


'VAATATTCAGTAACTTGTTT 


9849 ] 


Primer F J 


CAG CAA TCG AAT CCC TT 


9849 ] 


Primer R 


rCT TGT TGT TTG OCT AGC TCC 
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BaySNS 


Naae 


Seguence 


9849 


Sea. 


CCT GCC TTA CTG AGA GCC AAA 


10079 


Primer F 


Bio - CACGCCAATTCCCACCATCCT 


10079 


Primer R 


C5TCCGTC(!lAGGGGnAATr?TnTTT 

X www «L w\aAm\iJ\jF\j^7wX^/XX w X w XXX 


10079 


Sea. 


AATGTGTTTCTTGGGGGT 


10747 


Primer F 


CTAACCATCTTCCAAATGCTTAATP 


10747 


Primer R 


BIO- TCCTTGAGTCTGIkGTTTCCC 

■'-^ ^ w ^ WW X X \M^wx w X A X X 


10747 


Seq . 


CACAAGAAACICCTGAAA 


11578 


Primer F 


CTC GGC GTG PTT Gf3T AIXT 2L& 

wXW WWW \7XV9 W»XX OwX JVlx 


11578 


Primer R 


CGiG AGC CGA ACT .CTG- GAG GAA TPT . 


11578 


Seer 


GGC TGG PJVA GTT GTT PP21 TPP nnr« 

V7\JV» X\3\9 OXX OX X X^L» LixiL- 


11644 


Primer P 


TGA GPA GPG PAT . PPT 

XVSn UwA VTwV VmTLX 


11644 


Primear R 


TGP AGP PP2V PTV3 iPT PJia 

XVJW AWW V^^mTI \.»Xw AV»X ^mAA- 


11644 


Seo[. 


GPT GTT APT PAG TAT GAT 

>7wX WXX A^X ^aAw XAX wax 


12008 


Primer P 


COGAAGAPPAAGAPGP 


12008 


Primer R 


Bio - TCTTCCATAAAAAPAAGGPTP 

aJm\J > X w X X VvWAXAAAnnwAAUwW X W 


12008 


Se<2» . 


AAACAAQAAATTPTGTTTA 

<^'9c%w<vikwi'rvic^x X wXwx X XA^ 


13937 


Primer P 


K5A CAG OTP CCA TTG GAA 

X wn wn>w w x w ww^i x x w wCnn 


13937. 


Primer R 


AAT TAA TGC GAT CCC TC 

X\9W W^XX %wWw Xw 


13937 


Seer. 


GAC AGC TCC CAT TGG AAG 

>v*w f^ww X WW w^xx X X>A>w 


900002 


Primer F 


ATTGGGCAGGGATAAfiAGAAAAG 


900002 


Primer R 


Bio-GATGAATCACAGAATGCGGTAT 

w>**r** v«**W*»W*^»»X WV»%J^*XJ^X 


900002 


Seer. 


CACACAGCAGTTCACGCA 


900013 


Primer P 


GCQ\AGACTAGGAAGTAAGTGT 


900013 


Primer R 


Bio- CCCAGAACCACAAA6CTAGTAA 

WWWC^WnnWWAWAAAiWW X^l^P X^^^X 


900013 


Seg. • 


TGCCCTGGTCACCTCCTTTGC 


900025 


Primer F 


Bio-AGTGGCTCACTTGCTAACG 


900025 


Primer R 


CT6GGGAAGAAAATAAAT6AA - 


900025 


Seg. 


CtrrGCrrCTTAGGATACACGT 


900032 


Primer F 


W5CGTCTTCACCATCTQCT 


900032 ] 


Primer R 


3io-GGGAA0GAGGAAGCCAAACA 


900032 i 


Seg. 


^CATGTCTGATGATACCTGG 



wo 2004/01871 



PCT/EP2003/009126 



•129- 



BaySNP 


Nam 


Sequence 


900045 


Primer F 


BIOrGCCATGCACGATTTCCC * 


900045 


Primer R 


CAGTGTGCCCATCTACGAG 


900045 


Seq, 


GGACCTGACTGATTACCT 


900065 


Primer F 


GAGTAGCTAGGATCACAGGTGCGT 


900065 


Primer R 


BIO-TGTTCGAGATTTAAGAAAGTTGGC 


900065 


Seq, 


CAGGT6CGTGCCACCATGCCC 


900082 


Primer F 


CAC ACA ATT TTC CAC TTA 


900062 


Primer R 


GAG TCC ACT TTT CTA TCA 


900082 


Seq. 


ATG TTG ATG TAA TCT ACT 


900096 


Primer F 


TGGGGCAAGCAACAGTGGT 


900096 


Primer R 


Bio- TAGGCAGGGCAAGGGATTAGG • 


900096 


Seq. 


TTTAAATTCTCTGACAGAGAC 


900107 


Primer F 


B 1 0 - GCCy^CCAGCCCACACTCTGAACCTG 


900107 


Primer R 


CCATCAGCCTTCACCCACGTGCCA 


900107 


Seq. 


GCCTCAGCTTGACCT 


900115 


Primer F 


Bio-GGTAAGTGCGTGCCTOGGAGATGC 


900115 


Primer R 


CX^GGTGGGGAGGACAGAGC 


900115 


Seq. 


5AGGACA6AGCAAAAGGAT 


900121 


Primer F 


Bio-TGCCTTACAATATACSATGG 


900121 ] 


Primer R ( 


;AAT66GTAAG6AGTAAAGTT 


900121 i 


Seq. 


rrcxaiGCTGcrTTTA 
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Table2c 

Oligonucleotide primers used for genotyping using Restriction Fragment 
5 Length Polymorpliism (RFLF) 

The baySNP number refers to an internal numbering of the PA SNPs. Primer 
sequences are listed fer preamplification of the genomic fiagments. The restriction 
«izyme used for RFPL is indicated. 



baySNP 


Name 


Sequence 


Enzyme 


900173 


Primer F 


GAACAAACCTCCGAGATGCTAC 


Hind III 


900173 


Primer R 


GTCTTATGTTACTGGGCTTTCACC 


Hind III 
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Fam- 

CCCACCTCCCTCCTAGCCTCAGTT- 
Tamra 


1 

o ■ - 

6 

f g p 1 g i 1 3 i 1 ? 1 p 


u 

s 

D 

i 

i .ii 

u u «c < 

g 5! 8 ^ 


Vic - COAATGTGgCTGCCCAGCC - 
TAMRA 

AACAGATCTGGTCTaCCT 

CCCACCTGOaQAAT 

TGAQAAAAAAGgTTCCG 

TGCTCAGGAtAGCC 

AGGAAGcGTGGCCT 

CGCCCAOTAATaCAGA 

TCGTTCCAcTGGACGT 

TCGGOGCTgGTC 

CTTGOCgTCGCOTC 

TTGAAAGGACACCaTATT 

CACTAAAGCTGTaATATTA 

TCTTCCTCTGgGTAACA 




is i 

1 iigEigiiigig 


I \ 

& 8 


6 E 


1 in u 


6462 

8060 

8816 

10600 

10771 

10948 

11001 

11073 

11248 

11654 

11655 

13191 
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■> 

;i • 

fto) 

1®. • 

•V. • 


Human acidic 82 kDa protein mRNA, complete cds. 


Human T-lymphoma invasion and metastasis inducing TIAMl 
protein (TIAMl) mRNA, complete cds. 


Human t-lymphoma invasion and metastasis inducing TIAMl 
protein (TIAMl) mRNA, complete cds. 


Human T-lymphoma invasion and metastasis inducing TIAMl 
protein (TIAMl) mRNA, complete cds. 


Rsapiehs gene for mitochondrial ATP synthase c subunit (PI fonn) 


i.' - J 
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U15552 


HS162961 
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••• •' f.- 
1 ../>•••? 

'•■i ■■ .>' V, IVl i 
■ . • ■ -'^-it-. ; 


Homo s{q>iens B94 protein mRNA, complete cds. 


Homo sapiens (clones: CW52-2, CW27-6. CW15-2, CW26t5, 11-67) 
collagen type Vn intergenic region and (C0L7A1) gene, complete 
cds. 


Homo sapiens catechol-O-methyltransferase (COMT) mRNA, 
complete cds. 


H.8apiens mRNA for D1075-like gene 


Human coagulation factor V mRNA, complete cds. 


Homo sapiens ITGB3 gene, intron 2, fragment C, partial sequence. 


H. sapiens hexoldnase I (MK-16) 


H.sapiens mRNA for NEFA protein 


Human nerve growth factor receptor mRNA, complete cds. 
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1 . . •: . 
'.'* " 

1, ' • • ■. ■ ! 

* ^ 
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Homo sapiens MSH55 gene, partial cds; and CLICl. DDAH, G6b, 
G6c, G5b, G6d, G6e, G6f, BAT5, G5b. CSK2B, BAT4, G4, Ap M, 
BAT3, BAT2, AIF-1, 1C7, LST-1, LTB, TNF, and LTA genes, 
complete cds. 


Homo sapiens (clones lambda gMHC 1,2,3 and 4) beta-myosin heavy 
chain (MYH7) gene, complete cds. 


Human flavm-containing monooxygenase (FMOl) mRNA, complete 
cds. 


Human flavm-contaimng monooxygenase (FMOl) mRNA, complete 
cds. , 


Homo sapiens methionine adenosyltransferase alpha subunit gene 
fragment. 


< 

vo 

09 
CO 

K 


H.8apiens YAP65 mRNA 


Human coagulation factor VD (FT) gene exon 1 and factor X (FIO) 
gene, exon 1. 


Human apolipoprotein A-IV gene, complete cds. 


CALCIUM-TRANSPORTING ATPASE PLASMA MEMBRANE, 
ISOFORMS 3A/3B (EC 3.6.1.38) (CALCIUM PUMP) (PMCA3). 
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■^v.'.;-4 

; ■ • ■.- 

WA 


Human cytochrome P450cl7 (steroid 17-alpha-hydroxylase/17,20 
lyase) mRNA, complete cds. 


Human cytochrome P450cl7 (steroid 17-alpha-hydroxylase/17,20 
lyase) mRNA, complete cds. 


Homo sapiens vascular endothelial growth factor mRNA, complete 
cds. 


Rsapiens angiotensinogen gene S'r^gion and exon 1 


Homo sapiens DNA sequence from PAC 329E20 on chromosome 
lp34.4-36.13. Contains endothelin-converting-cnzyme 1 (ECE-l), 
EST, STS,CA repeat 


Homo sapiens PAC clone RP1-102K2. fiom 22ql2.1-qter, complete 
sequence. 


Homo sapiens chromosome 19, cosmid R31449, complete sequence. 


Homo sapiens axm (AXIN) mRNA, partial cds. 


Homo sapiens WNTl inducible signaling pathway protein I (WISPl) 
gene, promoter and partial cds. 


Homo sapiens cadherin-13 mRNA, complete cds. 


Human cytochrome P450 (CYPl A2) gene, exons 1 and 2. 
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'.: : ; 


Homo sapiens lipoprotein lipase precursor, gene, partial cds. 


Homo salens sterol 12-aIpha hydroxylase CYP8B1 (CypSbl) 
mRNA, partial cds. 


Human tissue factor gene, complete cds. 


Human coagulation factor DC gene, complete cds. 


Human DOCKl 80 protein mRNA, complete cds. 


Homo s^iens mRNA for diacylglycerol kinase delta, complete cds. | 


Human calmodulin mRNA, complete cds. 


Human NaJC-ATPase subunit alpha 2 (ATPI A2) gene, complete cds. 


Human Na^C-ATPase subunit alpha 2 (ATPI A2) gene, complete cds. 


Homo sapiens cathepsm B mRNA, 3' UTR with a stem-loop structure 
providing mRNA stability. 


Human mRNA for OB-cadherin-2, complete cds. 


Homo sapiens endothelial cell protein C receptor precursor (EPCR) 
gene, complete cds. . 


Human apolipoprotein A-I and C-m genes, complete cds. 


Human mRNA for hpoprotem apoAI Human apolipoprotein A-I and 
C-m genes, complete cds. 
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Human apolipoprotein A-I and C-III genes, complete cds. 


Human retinoid X receptor beta (RXR-beta) mRNA, complete cds. 


Human keratinocyte lectin 14 (HKL-14) mRNA, complete cds. | 


H.sapiens mRNA for coUigin (a collagen-binding protein) 


NADH-UBIQUINONE OXIDOREDUCTASE CHAIN 5 (EC 
1.6.5.3). 
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Human c-kit gene. 


H.sapiens mRNA for ATL-derived factor/thiredoxin. 


H.sapiens mRNA for activin beta-C chain 


Homo sapiens lammin-related protein (LamA3) mRNA, complete 
cds. 


H.sapiens ALK-3 mRNA. 


ILsapiens ALK-3 mRNA. 


Homo sapiens mRNA for platelet glycoprotein VI, complete cds. 


Homo sapiens mRNA for platelet glycoprotein VI, complete cds. 


Human gene for thrombopoietin, aconl-exon6, complete cds. 
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Claims 

1 . An isolated polynucleotide encoded by a phenotype associated (PA) geae; ttie 
polynucleotide is selected from the group comprising 
SEQ ID 1, 2, 3. 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 
22, 23, 24. 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 
43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 
64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 
85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 
104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 
119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 
134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146. 147, 148, 
149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, .160. 161, 162, 163, 
164, 165, 166, 167, 168 with allelic variation as indicated in the sequences 
section contained in a functional surrounding like fiill length cDNA for PA 
gene polypeptide and with or without the PA gene promoter sequence. 

2. An e)qiressiQn vector containing one or more of the polynucleotides of 
claim 1. 

3. A host cell c<mtaining the expression vector of claim 2. 

4. A substantially purified PA gene polypeptide encoded by a polynucleotide of 
claim 1. 

5. A method for producing a PA gene po^eptide, wherein ttie mebod 
conqirises tiie following steps: 

a) culturiiig the host cell of claim 3 loider conditions suitable for tiie 
e?q}ression of tiie PA gene polypqitide; and 

b) recovering tiie PA gene polypq)tide fiom tiie host cell culture. 
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6. A metiiod for the detection of a polynucleotide of claim 1 or a PA gene 
polj^eptide of claim 4 coiqprising the steps of: 

contacthg a biologiod sample with a reagent which specifically interacts with 
the polynucleotide or ttie PA gene pdlypeptide. 

5 . 

7. A method of screening for agents which regulate ttie activity of a PA gene 
comprising the steps of: 

contacting a test conywund with a PA gene polypeptide encoded by any 
polynucleotide of claim 1; and detecting PA gene activity of the polypeptide, 
10 wherein a test compound which increases the PA gene polypq)tidc activity is 

identified as a potential tiierapeutic agent for increasing the activity of the PA 
gene polypq)tide and wherein a test compound which decreases the PA 
activity of the poIypq)tide is identified as a potential therspeutic agent for 
decieasing the activity of the PA gene polypeptide. 

15 . 

8. - A reagent fbat modulates the activity of a PA polypeptide or a polynucleotide 

wherein said reagent is identified by the metiiod of die claim 7. 

9. A pharmaceutical conqx>sition, con^irising: 

20 tiie eT^ression vector of claim 2 or the reagent of claim 8. and a 

phaimaceutically acceptable carrier. 

10. Use of the reagent according to claim-S for the prq)aration of a medicament 

25 11. A method for determining whether a human subject has, or is at risk of 
developing a cardiovascular disease, comprising determining the identity of 
nucleotide variations as indicated in the sequences section of SEQ ID 1-168 
of tiie PA gene locus oftfae subject and where the SNP class of&eSNP is 
''CVD'* as can be seen bom table 3 ; whereas a "risl^ genotype has a risk ratio 

30 ofgreaterthan 1 ascanbeseenfiomtable6. 
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12. A method for detennining a patienfs mdividual response to statin ther^y, 
including drug efficacy and adverse drug reactions, comprising, detemining 
the identity of nucleotide variations as indicated in the sequences section of 
SEQ ID 1-168 of the PA gene locus of the subject and where the SNP class 
of the SNP is "ADR*', •^EFF' or both as can be seen from table 3; whereas the 
probability for such response can be seen from table 6. 

13. Use of the method according to claim 12 for the preparation of a medicament 
tailored to suit a patient's individual response to statin thenq)y. 

14. A kit for assessing cardiovascular status or statin response, said kit 
comprising 

a) sequence detennination primers and 

b) sequence detennination reagents 

wherein said primers are selected from the group con^sing primers that 
hybridize to polymoiphic positions in human PA genes according to claim 1; 
and primers that hybridize immediately adjacent to polymorphic positions in 
human PA genes according to claim 1. 

15. A kit as defined in claims 12 detectmg a combination of two or more, up to 
all, polymorphic sites selected from the g^ups of sequences as defined in 
claim 1. . 

16, A kit for assessing cardiovascular status or statin response, said kit 
comprising one or more antibodies specific for a polymorphic position 
defined in claim 1 within the human PA gene polypeptides and combinations 
of any of fiie foregoing. 



